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NOVEL POLYKETIDE DERIVATIVES AND RECOMBINANT 
METHODS FOR MAKING SAME 

This application is a continuation-in-part of co-pending U.S. Serial No. 08/858,003, 
filed May 16, 1997 which is a continuation-in-part of co-pending U.S. Serial No. 07/642,734, 
filed January 17, 1991. 

Technical Field 

The present invention relates to novel polynucleotide sequences, proteins encoded 
therefrom which are involved in the biosynthesis of polyketides, methods for directing the 
biosynthesis of novel polyketides using those polynucleotide sequences and novel derivatives 
produced therefrom. In particular, the invention relates to the production of novel polyketide 
derivatives through manipulation of the genes encoding polyketide synthases. 

Background of the Invention 
Polyketides are a large class of natural products that includes many important 
antibiotic, antifungal, anticancer, antihelminthic, and immimosuppressant compounds such as 
erythromycins, tetracyclines, amphotericins, daunorubicins, avermectins, and rapamycins. 
Their synthesis proceeds by an ordered condensation of acyl esters to generate carbon chains 
of varying length and substitution panem that are later converted to mature polyketides. This 
process has long been recognized as resembling fatty acid biosynthesis, but with important 
differences. Unlike a fatty acid synthase, a typical polyketide synthase is progranrmied to 
make many choices during carbon chain assembly: for example, the choice of " starter" and 
" extender" units, which are often selected from acetate, propionate or butyrate residues in a 
defined sequence by the polyketide synthase. The choice of using a full cycle of reduction- 
dehydration-reduction after some condensation steps, omitting it completely, or using one of 
two incomplete cycles (reduction alone or reduction followed by dehydration) is additionally 
programmed, and determines the pattem of keto or hydroxy 1 groups and the degree of 
saturation at different points in the chain. Finally, the stereochemistry for the substituents at 
many of the carbon atoms is programmed by the polyketide synthase. 

Streptomyces and the closely related Saccharopolyspora genera are producers of a 
prodigious diversity of polyketide metabolites. Because of the commercial significance of 
these compounds, a great amount of effort has been expended in the study oi Streptomyces 
and Saccharopolyspora genetics. Consequently, much is known about these organisms and 
several cloning vectors and techniques exist for their transformation. 

Although many polyketides have been identified, there remains the need to obtain 
novel polyketide structures with enhanced properties. Current methods of obtaining such 
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molecules include screening of natural isolates and chemical modification of existing 
polyketides, both of which are costly and time consuming. Current screening methods are 
based on gross properties of the molecules, i.e. antibacterial, antifungal activity, etc., and both 
a priori knowledge of the structure of the molecules obtained or predetermination of 
enhanced properties are virtually impossible. Chemical modification of preexisting structures 
has been successfully employed to obtain novel polyketides, but still suffers from practical 
limitations to the type of compoimds obtainable, largely connected to the poor yield of 
multistep synthesis and available chemistry to effect modifications. Modifications which are 
particularly difficult to achieve are those involving additions or deletions of carbon side 
chains. Accordingly, there exists a considerable need to obtain molecules wherein such 
changes can be specified and performed in a cost effective manner and with high yield. 

The present invention solves these problems by providing reagents (specifically, 
polynucleotides, vectors comprising the polynucleotides and host cells comprising the 
vectors) and methods to generate novel polyketides by de novo biosynthesis rather than by 
chemical modification. 

Summary of the Invention 
In one aspect, the present invention provides compounds of the formula: 



O 




Re 

X 

wherein Ri, R2, R3, R4, R5, and R6 are independently selected from Q wherein Q is selected 
firom the group consisting of (a) -H, (b) -Me, (c) -Et, and (d) -OH; R7 is selected firom the 
group consisting of -Et, -HOMe, and 13-3,4-dihydrocyclohexylmethyl; Li and L2 are 
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independently -H or -OH; L3 is D-desosamine or -OH; and L4 is L-mycarose, L-cladinose or 
-OH with the proviso that when R7 is -Et and R1-R5 are -Me, R6 is other than -H or -Me. 
Preferred compounds of the invention are those in Q is selected from the group consisting of 
(a), (b) and (c) above or (a), (b) and (d) above or (a), (c) and (d) above or (b), (c) and (d) 
5 above or (a) and (b) above or (a) and (c) above or (a) and (d) above or (b) and (c) above or (c) 
and (d) above, R7 is -Et and Li, L2, L3, and L4 are as defined above. Other preferred 
compounds include those in which Ri, R2, R3, R4, R5, and R6 are all -H or -Et or -OH, R7 is 
-Et and Li, L2, L3 and L4 are as defined above. Still other preferred compounds include 
didesmethyl, tridesmethyl, tetradesmethyl, pentadesmethyl, and hexadesmethyl derivatives of 
1 0 the compounds of formula X and particularly, di- tri-, tetra-, penta-, and hexadesmethyl 

derivatives of erythromycins A and B. Other especially preferred compounds of formula X 
include 6,10-didesmethyl-6-ethylerythromycin A, 10,12-didesmethyl-12-deoxy-12- 
ethylerythromycin A, 10,12-didesmethyl-12-deoxy-10-hydroxy erythromycin A, 6,10,12- 
tridesmethyl-6, 1 2-diethylerythromycin A, 6, 1 0,12-tridesmethyl-6-deoxy-6,l 2- 
15 diethylerythromycin A, 1 0-desmethylerythronolide B, 1 0-desmethyl-6-deoxyerythronolide B, 
12-desmethylerythronolide B, 1 2-desmethyl-6-deoxyerythronolide B, 12-desmethyl-12- 
ethylerythronolide B, 6-desmethyl-6-deoxy-6-ethylerythronolide B, 10- 
desmethylerythromycin A, lO-desmethyl-12-deoxy erythromycin A, lO-desmethyl-6,12- 
dideoxy erythromycin A, 12-desmethylerythromycin A, 12-desmethyl-12-deoxyerythromycin 
20 A, 12-desmethyl-6,12-dideoxyerythromycin A, 6-desmethyl-6-ethylerythromycin A, 12- 
desmethyl-12-ethylerythromycin A, 12-desmethyl-12-deoxy-12-ethylerythromycin A, 10- 
desmethyl-lO-hydroxy erythromycin A, 12-desmethyl-12-epihydroxyerythromycin A, 10,12- 
didesmethylerythromycin A, 10,12-didesmethyl-12-deoxyerythromycin A, 10,12- 
didesmethyl-6,12-dideoxy erythromycin A, 1 0-desmethylerythronolide B, lO-desmethyl-6- 
25 deoxyerythronolide B, 12-desmethylerythronolide B, 12-desmethyl-6-deoxyerythronolide B, 
1 0-desmethy lerythromycin A, 1 0-desmethy 1- 1 2-deoxy erythromycin A, 1 0-desmethyl-6, 1 2- 
dideoxy erythromycin A, 12-desmethy lerythromycin A, 12-desmethyl-l 2-deoxy erythromycin 
A, 1 2-desmethy 1-6, 12-dideoxy erythromycin A, 1 0,1 2-didesmethy lerythromycin A, 10,12- 
didesmethyl- 1 2-deoxy erythromycin A, and 10,12-didesmethyl-6,12-dideoxyerythromycin A. 
30 Most preferred compoimds include 1 0-desmethy lerythromycin A, 1 0-desmethy 1-12- 
deoxyerythromycin A, 12-desmethyl-l 2-deoxy erythromycin A, 8-desmethyl-8- 
hydroxyerythromycin A, 6-desmethyl-6-epierythromycin A, 4-desmethyl-4- 
hydroxyerythromycin A, 2-desmethyl-2-hydroxyerythromycin A, 13-desethyl-13- 
hydroxymethol erythromycin A, 2, 12-didesmethyl-2,12-dihydroxy erythromycin, 4,10- 
3 5 didesmethyl-4, 1 0-dihydroxy erythromycin, 1 0, 1 2-didesmethy 1- 1 0-hydroxyerythromycin, 
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6, lO-didesinethyl-6-ethyl-lO-hydroxy erythromycin A, and 13-desethyl-13-(3',4'- 
dihydroxycyclohexyl)methylerythromycin A. 

In another aspect, the present invention provides an isolated polynucleotide sequence 
or fragment thereof which encodes an enzymatically active acyltransferase domain from a 
5 PKS selected from Streptomyces hygroscopicus, Streptomyces venezuelae, and Streptomyces 
caelestis. Preferably, the polynucleotide sequence is SEQ ID NO:l, SEQ ID NO:2, SEQ ID 
NO:29 or SEQ ID NO:30. In another preferred embodiment, the polynucleotide sequence 
encodes an acyltransferase domain selected from the group consisting of SEQ ID NO:3 1 , 
SEQ ID NO:32, SEQ ID NO:33 and SEQ ID NO:34. 

10 The present invention also provides a vector comprising a polynucleotide sequence or 

fragment thereof which encodes which encodes an enzymatically active acyltransferase 
domain from Streptomyces. Preferably, the polynucleotide sequence is selected from those 
described above and the Streptomyces is Streptomyces hygroscopicus^ Streptomyces 
venezuelae^ or Streptomyces caelestis. A particularly preferred vector is pCS5. Other vectors 

15 of the invention include pUC 1 8/LigAT2, pEryATl /LigAT2, pEryAT2/LigAT2, 

pUC18/venAT, pEryATl/venAT, pUC19/rapAT14, pEryATl/rapAT14, pEryAT2/rapAT14, 
pUC/5'-flank/ethAT, pUC/ethAT/C-6, pEAT4, pUC18/NidAT6, pEryAT2/NidAT6. 
pEryATs/NidAT6 and pEryATs/rapligase 3.0. 

In another aspect, the invention provides host cells transformed with a vector as 

20 described above. The host cell may be a bacterial cell and preferably is selected from the 

group consisting of E. coli and Bacillus species. Alternatively, the host cell is a polyketide- 
producing microorganism. A preferred polyketide-producing host cell is selected from the 
group consisting of Saccharopolyspora species, Nocardia species, Micromonospora species, 
Arthrobacter species, Streptomyces species, Actinomadura species, andDactylosporangium. 

25 species. An even more preferred polyketide-producing host cell is selected from the group 
consisting of Saccharopolyspora hirsuta, Micromonospora rosaria^ Micromonospora 
megalomicea, Streptomyces antibioticus^ Streptomyces mycarofaciens, Streptomyces 
avermitiliSj Streptomyces hygroscopicus^ Streptomyces caelestis, Streptomyces tsukubaensis, 
Streptomyces fradiae, Streptomyces platensis^ Streptomyces violaceoniger, Streptomyces 

30 ambofaciens, Streptomyces griseoplanus, and Streptomyces venezuelae. Of these host cells, 
Saccharopolyspora erythraea^ Streptomyces hygroscopicuSy Streptomyces venezuelae, and 
Streptomyces caelestis are most preferred. 

The invention also provides a method for altering the substrate specificity of a 
polyketide synthase in a first polyketide-producing microorg2uiism comprising the steps of 

35 (a) isolating a first and second genomic DNA segment, each comprising a polyketide 

synthase wherein the first genomic DNA segment is from the first polyketide-producing 
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microorganism and the second genomic DNA segment is from the first polyketide-producing 
microorganism or a second polyketide-producing microorganism; 

(b) identifying one or more discrete fragments of the first genomic DNA segment, 
each of which encodes an acyltransferase domain; 

5 (c) identifying one or more discrete fragments of the second genomic DNA 

segment, each of which encodes a related domain to the acyltransferase domain of the first 
genomic DNA segment; and 

(d) transforming a cell of the first polyketide-producing microorganism with one 
or more of the fragments fi*om step (c) under conditions suitable for the occurrence of a 

10 homologous recombination event, leading to the replacement of one or more of the fragments 
from the first genomic DNA segment with one or more of the fragments from step (c). In one 
embodiment, the first polyketide-producing microorganism is Saccharopolyspora erythraea 
and the second polyketide-producing microorganism is Streptomyces. Preferred 
Streptomyces are selected firom the group consisting of Streptomyces antibioticus, 

1 5 Streptomyces mycarofaciens, Streptomyces avermitilis, Streptomyces hygroscopicus^ 

Streptomyces caelestis, Streptomyces tsukubaensis, Streptomyces fradiae^ Streptomyces 
platensis, Streptomyces violaceoniger^ Streptomyces ambofaciens, and Streptomyces 
Venezuelan Even more preferred Streptomyces are Streptomyces caelestis, Streptomyces 
hygroscopicus, or Streptomyces venezuelae. In a second embodiment, the first polyketide- 

20 producing microorganism is a Streptomyces as described above and the second polyketide- 
producing microorganism is Saccharopolyspora erythraea. Also in a preferred embodiment, 
the related domain is selected from the group consisting of SEQ ID NO:3 1 , SEQ ID NO:32, 
SEQ ID NO:33, and SEQ ID NO:34. 

25 Brief Description of the Drawings 

The present invention will be more readily appreciated in connection with the 

accompanying drawings. 

FIG. 1 is a proposed metabolic pathway for the biosynthesis of erythromycin A in 

Sac. erythraea, 

30 FIG. 2 is a schematic representation of the erythromycin PKS. 

FIG. 3 is a Growtree analysis of AT domains fi'om Streptomyces hygroscopicus (S. 
hygroscopicus; LigAT2 and rapATl-14), Streptomyces venezuelae {S. venezuelae; venAT) 
and Saccharopolyspora erythraea (Sac. erythraea; eryATl-6). 

FIG. 4a is a schematic representation of gene replacements of EryATl with LigAT2 
35 or venAT and EryAT2 with LigAT2 in Sac. erythraea. 

FIG. 4b is a schematic representation of gene replacements of EryAT4 with an ethyl 
AT (NidATS) in Sac. erythraea. 
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FIG. 5 is a diagrammatic representation of gene replacement by homologous 
recombination. 

FIG. 6 is a schematic representation of the genetic organization of the Ligase-PKS 
cluster from iS. hygroscopicus ATCC 29253. 
5 FIG. 7 represents the nucleotide sequence (SEQ ID NO: 1 , top strand) and 

corresponding amino acid sequence (SEQ ID N0:31, bottom strand) of LigAT, the malonyl 
AT domain from module 2 of the Ligase-PKS cluster of S, hygroscopicus ATCC 29253. 

FIG. 8 is a diagrammatic representation of the strategy to clone the LigAT2 domain. 

FIG. 9 is a flow diagram depicting the cloning of the EryATl flanking regions in 
10 plasmidpCS5. 

FIG. 10 is a flow diagram depicting construction of pEryATl/LigAT2. 

FIG. 1 1 is a flow diagram depicting the cloning of the EryAT2 flanking regions in 
plasmid pCS5. 

FIG. 12 is a flow diagram depicting construction of pEryAT2/LigAT2. 
15 FIG. 1 3 represents the nucleotide sequence (SEQ ID NO:2, top strand) and 

corresponding amino acid sequence (SEQ ID NO:32, bottom strand) of venAT, the malonate 
AT domain from the PKS cluster (hereinafter designated pven4) from 5. venezuelae ATCC 
15439. 

FIG. 14 is a diagrammatic representation of the strategy to clone the venAT domain. 
20 FIG. 1 5 is a flow diagram depicting construction of pEryATl/venAT. 

FIG. 16 is a diagrammatic representation of the strategy to clone the rapAT14 domain. 
FIG. 17 is a flow diagram depicting construction of pEryATl/rapAT14. 
FIG. 1 8 is a flow diagram depicting construction of .pEryAT2/rapAT14. 
FIG. 19 is a schematic representation of the genetic organization of the PKS cluster 
25 from Streptomyces caelestis NRRL-282 1 . 

FIG. 20 is a diagram of the structure of the macrolide ring of niddamycin. 
FIG. 21 represents the nucleotide sequence (SEQ ID NO:29, top strand) and 
corresponding amino acid sequence (SEQ ID NO:33, bottom strand) of NidATS, the ethyl AT 
domain from module 5 of the PKS cluster of Streptomyces caelestis NRRL-282 1. 
30 FIG. 22 is a flow diagram depicting the construction of pUC/ethAT/C-6. 

FIG. 23 is a diagram showing the nucleotide changes made to create dSiAvrll site at 
the 5* end of NidATS. 

FIG. 24 is a diagram of the replacement plasmid pEAT4. 
FIG. 25 represents the nucleotide sequence (SEQ ID NO:30, top strand) and 
35 corresponding amino acid sequence (SEQ ID NO:34, bottom strand) of NidAT6, the AT 
domain in module 6 of the niddamycin PKS cluster. 
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FIG. 26 is a diagrammatic representation of the strategy to clone the NidAT6 domain. 
FIG. 27 is a flow diagram depicting construction of pEryAT2/NidAT6. 
FIG. 28 is a flow diagram depicting the cloning of the EryATs flanking regions in 
plasmid pCS5. 

5 FIG. 29 is a flow diagram depicting the construction of pEryATs/NidAT6. 

FIG. 30 is a flow diagram depicting the cloning of pEryMl/NidAT6. 
FIG. 31 is a flow diagram depicting the construction of the plasmid 
pSLll 80/rapligase 3.0. 

FIG. 32 is a flow diagram depicting construction of the plasmid pEryATs/rapligase 

10 3.0. 

DETAILED DESCRIPTION OF THE INVENTION 

1. Definitions: 

15 For the purposes of the present invention as disclosed and claimed herein, the 

following terms are defined: 

The term " polyketide" as used herein refers to a large and diverse class of natural 
products including but not limited to antibiotic, anticancer, antihelminthic, antifimgal, 
pigment, and immunosuppressant compounds. Antibiotics include but are not limited to 

20 anthracyclines, tetracyclines, polyethers, polyenes, ansamycins, and macrolides of various 
types such as avermectins, erythromycins, and niddamycins. The term polyketide is also 
intended to refer to compounds of this class that can be used as intermediates in chemical 
syntheses. For example, erythromycin A is a polyketide that is isolated and used in the 
synthesis of the antibiotic clarithromycin. Polyketides used as intermediates do not 

25 themselves necessarily have any biological or therapeutic activity. 

The term "polyketide-producing microorganism'* as used herein includes but is not 
limited to bacteria from the order Actinomycetales, Myxococcales or other Eubacteriales that 
can produce a polyketide. Examples of actinomycetes and myxobacteria that produce 
polyketides include but are not limited to Saccharopolyspora erythraea^ Saccharopolyspora 

30 hirsuta, Micromonospora rosariOy Micromonospora megalomicea^ Sorangium cellulosum^ 
Streptomyces antibioticus^ Streptomyces mycarofaciens^ Streptomyces avermitilis, 
Streptomyces hygroscopicus, Streptomyces caelestis, Streptomyces tsukubaensis, 
Streptomyces fradiae^ Streptomyces platensiSy Streptomyces violaceoniger, Streptomyces 
ambofacienSy Streptomyces venezuelae and various other Streptomyces^ Actinomadura^ 
35 Dactylosporangium and Amycolotopsis strains that produce polyketides. Yeast and fimgi that 
produce polyketides are also considered " polyketide-producing microorganisms" . Examples 
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of fungi that produce polyketides include but are not limited to members of the genus 
Aspergillus. 

The term "polyketide synthase" (PKS) as used herein refers to a complex of enzyme 
activities responsible for the biosynthesis of polyketides. The enzymatic activities contained 
5 within a PKS include but are not limited to P-ketoreductase (KR), dehydratase (DH), 
enoylreductase (ER), p-ketoacyl AC? synthase (KS), acyl carrier protein (ACP), 
acyltransferase (AT) and thioesterase (TE). The polypeptide fragment responsible for each 
enzymatic activity is referred to as a " domain" . A " module" refers to a group or set of 
domains which carry out one condensation step in the process of polyketide formation and 
10 may or may not include domains which effect processing of the p-carbonyl group in the 
growing polyketide. 

The term "Type I PKS" as used herein refers to a PKS which is a large 
multifunctional protein and is exemplified by DEBS (see below). The term "Type II PKS" 
refers to a PKS having several separate, largely monofimctional enzymes, and is exemplified 
15 by the PKSs responsible for the biosynthesis of actinorhodin and tetracenomycin (C.R. 
Hutchinson and I. Fujii, Annu. Rev. Microbiol. 49:201-238 (1995)). 

The term " cognate domains" as used herein refers to the members of a specific set of 
domains which constitute a naturally occurring single module. 

The term " related domain" or " heterologous domain" as used herein refers to a PKS 
20 domain which is functionally similar to a second PKS domain. By " functionally similar" it 
is meant that each domain catalyzes a particular type of reaction but acts upon a different 
substrate. For example, the AT domain of module 1 of Sac. erythraea (eryATl) and the AT 
domain of module 14 of S. hygroscopicus (rapATH) both catalyze the transfer of an 
extender unit to a corresponding ACP domain. In the case of Sac. erythraea^ however, 
25 eryATl utilizes methylmaloriyl Co A as a substrate whereas in S. hygroscopicus^ rapAT14 
utilizes malonyl CoA. Thus, eryATl and rapATH are considered to be "related" or 
"heterologous" domains. 

The term " condensation" as used herein refers to the addition of an extender unit to 
the nascent polyketide chain and requires the action of KS, AT and ACP domains of the PKS. 
30 The term " starter" as used herein refers to a coenzyme A thioester of a carboxylic 

acid which is used by a polyketide synthase as the first building block of the polyketide. 

The term "extender" as used herein refers to a coenzyme A thioester of a dicarboxylic 
acid that is incorporated into a polyketide by a polyketide synthase at positions other than the 
first position. 
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The term "DEBS" as used herein refers to the enzyme 6-deoxyerythronolide B 
synthase, the PKS that builds the polyketide-derived macrolactone 6-deoxyerythronolide B 
(6-DEB). 

The term as used herein refers to the genes which encode the DEBS. 

5 The term "homologous recombination" as used herein refers to crossing over between 

DNA strands containing identical sequences. 

The term " isfolated" as used herein means that the material is removed from its 
original environment (e.g. the natural environment where the material is naturally occurring). 
For example, a naturally occurring polynucleotide or polypeptide present in a living animal is 
10 not isolated, but the same polynucleotide or polypeptide, which is separated from some or all 
of the coexisting materials in the natural system, is isolated. Such polynucleotides could be 
part of a vector and/or such polynucleotides or polypeptides could be part of a composition, 
and still be isolated in that the vector or composition is not part of the natural environment. 

The term "restriction fragment" as used herein refers to any linear DNA generated by 
15 the action of one or more restriction enzymes. 

The term "transformation" as used herein refers to the introduction of DNA into a 
recipient microorganism, irrespective of the method used for the insertion into the 
microorganism. 

The term "replicon" as used herein means any genetic element, such as a plasmid, 
20 chromosome or virus, that behaves as an autonomous unit of polynucleotide replication 

within a cell. A " vector" is a replicon in which another polynucleotide fragment is attached, 
such as to bring about the replication and/or expression of the attached fragment. 

The terms "recombinant polynucleotide" or "recombinant polypeptide" as used 
herein means at least a polynucleotide or polypeptide which by virtue of its origin or 
25 manipulation is not associated with all or a portion of the polynucleotide or polypeptide with 
which it is associated in nature and/or is linked to a polynucleotide or polypeptide other than 
that to which it is linked in nature. 

The term "host cell" as used herein, refers to both prokaryotic and eukaryotic cells 
which are used as recipients of the recombinant polynucleotides £md vectors provided herein. 
30 The term " open reading frame" or " ORF" as used herein refers to a region of a 

polynucleotide sequence which encodes a polypeptide; this region may represent a portion of 
a coding sequence or a total coding sequence. 

11. The Invention 

35 In its broadest sense, the present invention entails novel polyketides with therapeutic 

activity (e.g. antimicrobial, anticancer, antifungal, immunosuppressant and/or antihelminthic 
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activity) and immediate compounds of such polyketides. The invention also provides a 
method for producing novel polyketides in vivo by selectively altering the genetic 
information of an organism that naturally produces a polyketide. The present invention 
further provides isolated and purified polynucleotides that encode PKS domains (i.e. 
5 polypeptides) from polyketide-producing microorganisms, fragments thereof, vectors 
containing those polynucleotides, and host cells transformed with those vectors. These 
polynucleotides, fragments thereof, and vectors comprising the polynucleotides can be used 
as reagents in the above described method. Portions of the polynucleotide sequences 
disclosed herein are also useful as primers for the amplification of DNA or as probes to 
1 0 identify related domains from other polyketide-producing microorganisms. 

III. Polynucleotides 

The present invention provides isolated and purified polynucleotides that encode PKS 
domains (i.e. polypeptides) and fragments thereof which are involved in the production of 

15 polyketides. Polynucleotides included within the scope of the invention may be in the form 
of RNA, DNA, cDNA, genomic DNA and synthetic DNA. The DNA may be double- 
stranded or single-stranded, and if single-stranded may be the coding (sense) strand or non- 
coding (anti-sense) strand. The coding sequence which encodes a polypeptide may be 
identical to a coding sequence provided herein or may be a different coding sequence which, 

20 as a result of the redundancy or degeneracy of the genetic code, encodes the same polypeptide 
as the DNA provided herein. 

Polynucleotides may include only the coding sequence for a particular polypeptide or 
for a polypeptide which is functionally equivalent to the polypeptide sequences provided 
herein. Additionally, the invention includes variant polynucleotides containing modifications 

25 such as polynucleotide deletions, substitutions or additions; and any polypeptide modification 
resulting from the variant polynucleotide sequence. A polynucleotide of the present 
invention also may have a coding sequence which is a naturally occurring allelic variant of 
the coding sequence provided herein. 

Probes and primers constructed according to the polynucleotide sequences provided 

30 herein are also contemplated as within the scope of the present invention and can be used in 
various methods to provide various types of analysis. For example, primer sequences may be 
designed according to polynucleotide sequences which encode particular domains and then 
used to amplify polynucleotide sequences of the same or other related domains using well- 
known amplification techniques such as the polymerase chain reaction (PGR) and the ligase 

35 chain reaction (LCR). (PGR has been disclosed in U.S. patents 4,683,195 and 4,683,202, and 
LCR, in EP-A- 320 308 to K. Backman published June 16, 1989 and EP-A-439 182 to K. 
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Backman et aL, published July 31, 1991, all of which are incorporated herein by reference). 
Generation of primers for use in other amplification techniques or in variations of these 
amplification techniques, (such as nested PGR) is also contemplated within the scope of the 
invention and is considered within the knowledge of the routine practitioner. 

5 Probes and primers may be designed from conserved nucleotide regions of a 

polynucleotide of interest or from non-conserved nucleotide regions of a polynucleotide of 
interest. Generally, nucleic acid probes are developed from non-conserved or unique regions 
when maximum specificity is desired, and nucleic acid probes are developed from conserved 
regions when assaying for nucleotide regions of related members of a multigene family or in 

10 related species. Probes can also be labeled with radioisotopes or other detection labels for 
screening of recombinant libraries. 

Various methods for synthesizing primers and probes are well-known in the art as are 
methods for attaching labels to primers or probes. For example, it is a matter of routine to 
synthesize desired nucleic acid primers or probes using conventional nucleotide 

15 phosphoramidite chemistry and instmments available from Applied Biosystems, Inc., (Foster 
City, CA), Dupont (Wilmington, DE), or Milligen (Bedford MA). Many methods have been 
described for labeling oligonucleotides such as the primers or probes of the present invention. 
Commercially available probe labeling kits include those from Amersham Life Science 
(Arlington Heights, XL), Promega (Madison, WI), Enzo Biochemical (New York, NY) and 

20 Clontech (Palo Alto, CA). 

IV. Vectors and Host Cells 

The present invention provides vectors which include polynucleotides of the present 
invention and host cells which are genetically engineered with vectors of the present 

25 invention. 

a. Vectors and Expression Systems 

The present invention includes recombinant constructs comprising one or more of the 
sequences as broadly described above. The constructs comprise a vector, such as a plasmid 
or viral vector, into which a sequence of the invention has been inserted, in a forward or 

30 reverse orientation. Such vectors include chromosomal, nonchromosomal and synthetic 

DNA sequences from prokaryotic or eukaryotic sources. Large numbers of suitable plasmids 
and vectors are known to those of skill in the art, and are commercially available. Vectors 
which are particularly useftil for cloning and expression in intermediate hosts include but are 
not limited to: (a) Bacterial: pBR322 (ATCC 37017); pGEM (Promega Biotec, Madison, 

35 WI), pUC, pSPORTl and pProExl (Life Technologies, Gaithersburg, MD); pQE70, pQE60, 
pQE-9 (Qiagen); pBs, phagescript, psiX174, pBluescript SK, pBsKS, pNH8a, pNH16a, 
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pNH18a, pNH46a (Stratagene®, La Jolla, CA); pTrc99A, pKK223-3, pKK233-3, pDR540, 
pRIT5, and pGEX4T (Pharmacia®, Piscataway, NJ); and (b) Eukaryotic: pWLneo, pSV2cat, 
pOG44, pXTl, pSG (Stratagene®); pSVK3, pBPV, pMSG, pSVL (Pharmacia®); pcDNA3.1 
(Invitrogen, Carlsbad, CA). Other appropriate cloning and expression vectors for use with 
5 prokaryotic and eukaryotic hosts are described by Maniatis et al. Molecular Cloning: A 
Laboratory Manual , Second Edition, (Cold Spring Harbor Press, N.Y., 1982), which is 
hereby incorporated by reference. Generally however, any plasmid or vector may be used as 
long as it is replicable and viable in a host. 

In another embodiment, the construct is an expression vector which also comprises 
10 regulatory sequences operably linked to the sequence of interest, to direct mRNA synthesis 
and polypeptide production. Regulatory sequences known to operate in prokaryotic and/or 
eukaryotic cells include inducible and non-inducible promoters for regulating mRNA 
transcription, ribosome binding sites for translation initiation, stop codons for translation 
termination and transcription terminators and/or polyadenylation signals. In addition, an 
15 expression vector may include appropriate sequences for amplifying expression. 

Promoter regions may be selected from any desired gene. Particular named bacterial 
promoters include lacZ, gpt, lambda Pr, lambda Pl, trc, trp, ermE and its derivatives such as 
ermEPl TGG, also known in the art as ermE"^, (Bibb, M. J„ et al. Molecular Microbiology^ 
14(3): 533-545 (1994)), melCl and act II {CM. Kao, et al. Science, 265: 509-512 (1994)). 
20 Eukaryotic promoters include cytomegalovirus (CMV) immediate early, herpes simplex virus 
(HSV) thymidine kinase, early and late S V40, LTRs from retroviruses, mouse 
metallothionein-I, prion protein and neuronal specific enolase (NSE). Selection of the 
appropriate promoter is well within the level of ordinary skill in the art. In addition, a 
recombinant expression vector will include an origin of replication and selectable marker 
25 (such as a gene conferring resistance to an antibiotic (eg. neomycin, chloramphenicol, 
ampicillin, or thiostrepton) or a reporter gene (eg. luciferase)) which permit selection of 
stably transformed or transfected host cells. 

In any expression vector, a heterologous structural sequence (i.e. a polynucleotide of 
the present invention) is assembled in appropriate phase with translation initiation and 
30 termination sequences. Optionally, the heterologous sequence will encode a fusion protein 
including an N-terminal identification peptide imparting desired characteristics, e.g., 
stabilization or simplified purification of expressed recombinant product. 

Eukaryotic expression vectors will also generally comprise an origin of replication, a 
suitable promoter operably linked to a sequence of interest and also any necessary translation 
35 enhancing sequence, polyadenylation site, transcriptional termination sequences, and 5* 

flanking nontranscribed sequences. DNA sequences derived from the SV40 viral genome, for 
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example, SV40 origin, early promoter, enhancer, and polyadenylation sites may be used to 
provide the required genetic elements. Such vectors may. also include an enhancer sequence 
to increase transcription of a gene. Enhancers are cis-acting elements of DNA, usually about 
from 10 to 300 bp, that act on a promoter to increase its transcription rate. Examples include 
the SV40 enhancer on the late side of the replication origin (bp 100 to 270), a 
cytomegalovims early promoter enhancer, a polyoma enhancer on the late side of the 
replication origin, and adenovims enhancers. 

i. Vector construction 

The appropriate DNA sequence may be inserted into a vector by a variety of 
procedures. Generally, site-specific DNA cleavage is performed by treating the DNA with 
suitable restriction enzymes under conditions which are generally specified by the 
manufacturer of these commercially available enzymes. Usually, about 1 microgram (|ig) of 
plasmid or DNA sequence is cleaved by 1 unit of enzyme in about 20 microliters (|iL) of 
buffer solution by incubation at 37°C for 1 to 2 hours. After incubation with the restriction 
enzyme, protein can be removed by phenol/chloroform extraction and the DNA recovered by 
precipitation with ethanol. The cleaved fragments may be separated using poly aery lamide or 
agarose gel electrophoresis, according to methods known by the routine practitioner. (See 
Maniatis et al^ supra). 

Ligations are performed using standard buffer and temperature conditions and with a 
Hgase (such as T4 DNA ligase) and ATP. Sticky end ligations require less ATP and less 
ligase than blunt end ligations. Vector fragments may be treated with bacterial alkaline 
phosphatase (BAP) or calf intestinal alkaline phosphatase (CIAP) to remove the 5'-phosphate 
and thus prevent religation of the vector. Ligation mixtures are transformed into suitable 
cloning hosts such as E. coli and successful transformants selected by methods including 
antibiotic resistance, and then screened for the correct construct. 

ii. Transformation/Transfection 

Transformation or transfection of an appropriate host with a construct of the 
invention, such that the host produces recombinant polypeptides, may also be performed in a 
variety of ways. For example, a construct may be introduced into a host cell by calcium 
chloride or polyethylene glycol transformation, lithium chloride or calcium phosphate 
transfection, DEAE-Dextran mediated transfection, or electroporation. These and other 
methods for transforming/transfecting host cells are well known to routine practitioners (see 
L. Davis et al, "Basic Methods in Molecular Biology" , 2nd edition, Appleton and Lang, 
Paramount Publishing, East Norwalk, CT (1994) and D.A, Hopwood et ai. Genetic 
Manipulation of Streptomyces: a laboratory manual. The John Innes Foundation, Norwich, 
England (1985)). 
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b. Host Cells 

In one embodiment, the present invention provides host cells containing recombinant 
constructs as described below. In one aspect, a host cell may be an " intermediate" host 
which is used to produce polynucleotides of the invention on a large-scale basis (for the 
5 purpose of cloning and/or verifying recombinant polynucleotide sequences, for example) or 
as a means to maintain such polynucleotide sequences over time (i.e. as maintenance or 
storage strains). A "production" host is a host cell which is used to produce novel 
polyketides. The host cell (either intermediate or production) can be a higher eukaryotic cell, 
such as a mammalian cell, or a lower eukaryotic cell, such as a yeast cell, or a prokaryotic 

10 cell, such as a bacterial cell. Lower eukaryotic and prokaryotic cells are preferred 
intermediate and production hosts. 

Representative examples of appropriate hosts include bacterial cells, such as E, colU 
Bacillus subtilise Saccharopolyspora erythraea, Streptomyces caelestis, Streptomyces 
hygroscopicus, Streptomyces venezuelae; and various other species within the genera 

1 5 Arthrobacter^ Micromonospora^ Nocardia^ PseudomonaSy Streptomyces^ Staphylococcus, and 
Saccharopolyspora^ although others (of eukaryotic origin) may also be employed. Additional 
representative examples of host cells are polyketide-producing microorganisms (as defined 
above). The selection of an appropriate host is deemed to be within the scope of those skilled 
in the art from the teachings provided herein. 

20 Host cells are genetically engineered (transduced, transformed, transfected, 

conjugated, or electroporated) with the vectors of this invention which may be a cloning 
vector or an expression vector. The engineered host cells can be cultured in conventional 
nutrient media modified as appropriate for activating promoters, selecting transformants, or 
as a source of a biosynthetic substrate. The culture conditions, such as temperature, pH and 

25 the like, are those previously used with the host cell selected for expression, and will be 
apparent to the ordinarily skilled artisan. 

V. Novel Polyketides and Methods of Making Novel Polyketides 

The invention also provides novel polyketides, intermediate compounds thereof, and 
30 methods for producing novel polyketides. The methods utilize the polyketide biosynthetic 

genes from Sac. erythraea (i.e. the eryA genes) as well as those from other known polyketide- 
producing microorganisms and/or putative polyketide-producing microorganisms (i.e. those 
having nucleotide sequences which hybridize to known PKS sequences but whose polyketide 
products are unknown). 

35 The organization of eryA and the DEBS encoded therefrom (see FIG. 1 and FIG. 2) 

have been described in co-pending U.S. application Serial No. 07/642,734, filed January 17, 
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1991, which is incorporated herein by reference in its entirety. As FIG. 2 shows, DEBS is 
organized in modules, with each module being responsible for one condensation step through 
the action of the resident KS, AT and ACP domains within that module wherein an extender 
unit, methylmalonyl CoA, is added first to the starter unit, propionyl CoA, and then 
5 successively to the growing acyl chain. The precise succession of the elongation steps is 

dictated by the order of the six modules: module 1 determines the first condensation; module 
2, the second; module 3, the third, and so on until the sixth condensation step has occurred. 
In addition, the choice of extender unit that is incorporated into a growing polyketide chain at 
each condensation is determined, in whole or in part, by the AT domain within each module. 
10 In the case of DEBS, the extender unit incorporated is always methylmalonate. Thus, as 6- 
deoxyerythronolide B grows through successive condensations, two carbons are added to the 
nascent chain and every other carbon, st2irting with the carbon corresponding to C-12 in the 
ring, carries a methyl group as a side chain. 

As also seen in FIG. 2, the processing of the growing carbon chain after each 
15 condensation is determined by the information within each module. Thus, P-ketoreduction of 
the P-keto group generated by the condensation event takes place after each condensation 
step except the third, as determined by the presence of an active KR domain in each module 
except module 3, whereas dehydration and enoylreduction take place after the fourth 
condensation step, as determined by the presence of the DH and ER domains in module 4. 
20 Once the polyketide chain is fiilly synthesized, it is released from the PKS through the action 
of the TE domain present at the end of module 6 and cyclizes to form the macrocyclic lactone 
6-deoxyerythronolide B which is subsequently acted upon by a series of other enzymes, 
whose genes reside in the erythromycin cluster of the Sac. erythraea chromosome (see FIG. 
1). As shown in FIG. 1, erythromycin carries methyl side chains at position 2, 4, 6, 8, 10 and 
25 12, through the incorporation of methylmalonate as the extender unit at each step of synthesis 
of the polyketide moiety. 

In the present invention, novel polyketide molecules of a desired structure are 
produced by introducing specific genetic alterations into a PKS-encoding sequence in the 
genome of a polyketide-producing microorganism. Alteration of one or more genes or 
30 fragments thereof may be generated through manipulation of genes residing exclusively 

within a species (i.e. intraspecies alterations), and include not only manipulations of genes 
within a single PKS cluster but also between different PKS clusters residing within a single 
strain (as is seen in S. hygroscopicus). Several examples of intraspecies alterations showing 
the manipulation of genes exclusively within a single PKS (namely, eryA) are described in 
35 U.S. application Serial No. Qll&lAJ'iA cited supra. Alternatively, a gene or fragment thereof 
may be exchanged with a heterologous gene or gene fragment encoding one or more related 
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domains from the PKS of a different polyketide-producing microorganism (interspecies 
alterations). Several examples of novel polyketides produced from exchange of heterologous 
genes are provided herein. 

Whether the genetic manipulations are performed intraspecies or interspecies, three 
5 types of alterations to a PKS sequence may be carried out: (i) those which affect a module 
but do not cause the arrest of chain growth (Type I alterations); (ii) those which affect a 
single function in a module thereby causing the arrest of chain growth (Type II alterations); 
and (iii) those which affect an entire module (Type III alterations). In one embodiment. Type 
I alterations are produced by inactivation of domains that specify the functional groups and/or 

10 degree of oxidation found at specific ring positions in the native polyketide. Such domains 
typically include B-ketoreductases, dehydratases and enoylreductases. For example, an allele 
corresponding to B-ketoreductase of module 5 may be mutated by deleting a substantial 
portion of the DNA encoding the B-ketoreductase (thereby producing an inactive domain) and 
used to replace the wild-type allele in the native strain. Such a transfer results in the 

15 production of the novel polyketide 5-oxo-5,6-dideoxy-3-oc-mycarosyl erythronolide B. 

In an alternative embodiment. Type I alterations are generated by replacing at least 
one domain in a particular PKS with at least one related domain from the same or a second 
PKS. Such related domains may exist between different polyketide-producing 
microorganisms (such as for example, the AT domains of Sac. erythraea, S. venezuelae^ S. 

20 hygroscopicus, and S. caelestis) or within a single species (as for example, the LigAT2 and 
rap ATI domains in S. hygroscopicus). 

Ways to identify polyketide synthases, their domains and the functional similarity of 
domains are well-known to those of ordinary skill in the art. For example, the PKS region of 
the chromosome of a polyketide or putative polyketide-producing microorganism may be 

25 identified by hybridizing with nucleic acid probes under conditions of low or high stringency. 
Hybridization under high stringency conditions is generally performed in a buffer consisting 
of 15 mM sodium chloride and 1.5 mM trisodium citrate (0.1 x SSC) with an incubation 
temperature of about 65°C (see for example, Maniatis, et aL supra). To detect more distantly 
related PKS genes, hybridization is performed under low stringency conditions which include 

30 lower temperature incubations and/or the presence of increased amounts of sodium chloride 
and trisodium citrate (Maniatis, et al supra). Once identified, the chromosomal region may 
be isolated, cloned into a suitable vector and sequenced, using conventional methods or 
commercial sequencing kits such as Sequenase (US Biochemical Corp, Cleveland, OH). 
Methods for isolating and cloning chromosomal DNA are also well known in the art 

35 (Maniatis, et al supra). An amino acid sequence may then be deduced from the DNA 

sequence and a comparison made of the unknown amino acid sequence to that of one or more 
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polypeptides involved in polyketide biosynthesis. Two amino acid sequences showing at 
least about 20% and more preferably about 25% identity and having conserved active site 
residues or motifs are considered to specify functionally similar or equivalent PKS domains. 
Having identified such domains, the number and composition of modules as well as the 
5 arrangement of modules within particular ORFs can be determined. 

In the case where the newly defined PKS produces a polyketide of known structure, 
the B-carbonyl processing and types of side chain moieties and their positioning on the 
polyketide backbone can be correlated to specific domains within modules. Because modules 
are established linearly within ORPs, this correlation also allows one to determine the order 
10 of modular activity (i.e. which module catalyzes which condensation step) in the PKS. For 
example, the B-carbonyl processing and types of side chain moieties in the polyketide 
generates a pattern of chemical groups that can be correlated to a pattern of domains within 
an ORF. Based on the specific type of side chain moiety at a given carbon, one can then 
predict the particular substrate utilized by that module's AT domain. 
15 In the case where the polyketide structure is unknown, theoretically, comparative 

sequence analysis alone may be used to predict the substrate specificity of an AT domain. To 
accomplish this, at least two and preferably, three or more sequences known or predicted to 
specify a particular substrate can be compared to determine one or more conserved or 
consensus motifs unique to that family of ATs. An unknovm AT having such motifs can then 
20 be assigned to a particular family. 

Altematively, comparative analyses can be performed using computer programs 
which group AT domains based on primary amino acid sequence similarity or phylogenetic 
relationships. For example, comparative analyses were made of the amino acid sequences of 
the AT domains in DEBS with corresponding AT domains in the PKS for rapamycin to 
25 determine whether the extender imit used by a particular AT domain, (either malonate or 
methylmalonate), correlated with the degree of sequence identity between these domains. 
Rapamycin is a l2u:ge polyketide that is assembled through 14 condensation events; the 
rapamycin PKS possesses 14 AT domains whose sequences were deduced from knovra 
nucleotide sequences (Aparicio et al. Gene 169:9-16 (1996)). Amino acid sequence 
30 comparisons of the 14 AT domains of the rapamycin PKS with each other and with the 6 AT 
domains from DEBS, showed that the AT domains fell into two distinct groupings in which 
the rapamycin AT domains from modules 1, 3, 4, 6, 7, 10 and 13 clustered with the 6 
erythromycin AT domains and the rapamycin AT domains in modules 2, 5, 8, 9, 1 1, 12 and 
14 formed a separate cluster (Haydock et al. FEBS Letts. 374:246-248 (1995)). Examination 
35 of the polyketide structure of rapamycin indicated that methyl side chains were at positions 
on the lactone ring corresponding to condensation steps 1, 3, 4, 6, 7, 10 and 13, which 
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suggested that methylmalonate was used as the extender unit during synthesis of these 
sections of the acyl chain; protons at the positions of the lactone ring corresponding to 
condensations steps 2, 5, 8, 9, 1 1, 12 and 14 suggested that malonate was utilized as the 
extender unit during synthesis of these sections. Two additional AT domains described 
5 herein, ligAT2 and venAT, were also found to cluster with the putative malonate AT domains 
from the rapamycin PKS (FIG. 3). Having predicted that AT domains from rap modules 2, 5, 
8, 9, 1 1 , 12 or 14, as well as ligAT2 and venAT, specify malonate as extender units, the DNA 
encoding such domains could be isolated, cloned and used to replace the DNA encoding one 
or more AT domains in a PKS such as DEBS, in order to generate novel polyketides. 
10 The techniques for determining the amino acid sequence " similarity" are well-known 

in the art. In general, when two or more polypeptides are aligned with one another, their 
sequence similarity refers to the amino acids at corresponding positions within each 
polypeptide sequence that are identical or possess similar chemical and/or physical properties 
such as charge or hydrophobicity. A so-termed "percent similarity" then can be determined 
15 between the compared polypeptide sequences. In general, the term " identity" refers to an 

exact nucleotide to nucleotide or amino acid to amino acid correspondence at a given position 
of two polynucleotides or polypeptide sequences, respectively. Two amino acid sequences 
(or for that matter, two or more polynucleotide sequences) can be compared by determining 
their "percent identity." The programs available in the Wisconsin Sequence Analysis 
20 Package, Version 8 (available from Genetics Computer Group (GCG), Madison, WI), for 
example, the GAP program, are capable of calculating both the identity between two 
polynucleotides and the identity and similarity between two polypeptide sequences, 
respectively. Other programs for calculating and displaying similarity between sequences are 
known in the art. For example, the Growtree program (GCG, Madison, WI) creates a 
25 phylogenetic tree wherein the most closely related sequences are clustered and joined by the 
shortest lines. This tree is derived from a matrix created by the program Distances (GCG, 
Madison, WI) which calculates pairwise relationships within a group of aligned sequences. 

In a preferred embodiment, novel polyketide molecules of desired structure are 
produced by the replacement of at least one AT domain-encoding fragment of DNA of the 
30 Sac. erythraea chromosome with at least one heterologous AT domedn-encoding fragment of 
DNA from another PKS cluster to yield novel polyketide compounds which are derivatives of 
6-deoxyerythronolide B, erythronolide B, 3-a-L-mycarosylerythronolide B, or erythromycins 
A, B, C and D. Such derivatives are compounds wherein methyl (-Me) side chains at one or 
more positions of the macrocylic lactone ring are replaced by substituents independently 
35 selected from the group consisting of (a) -H; (b) ethyl group (-Et); (c) hydroxyl group (-OH) 
and (d) allyl group (-A1). In a particularly preferred embodiment, a method is provided for 
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the genetic modification of erythromycin-producing microorganisms which enables them to 
produce the novel compounds 12-desmethyl-12-deoxyerythromycin A, 10- 
desmethylerythromycin A, 10-desmethyM2-deoxyerythromycin A^, or 6-desmethyl-6- 
ethylerythromycin A. The compounds 12-desmethyl-12-deoxyerythromycin A, 10- 
5 desmethylerythromycin A, lO-desmethyl-12-deoxyerythromycin A, and 6-desmethyl-6- 
ethylerythromycin A are represented by the structural formulae: 




12-desmethyl-12-deoxyerythromycin A (I) lO-desmethyierythromycin A (II) 




lO-desmethyl-12-deoxyerythroniycin A (III) 6-desmethyl-6-ethylerythromycin A (IV) 



The general scheme for producing such polyketides is outlined in FIG. 4a and FIG. 4b. In the 
preferred embodiment, heterologous DNA fragments encoding related AT domains are 
introduced into the Sac. erythraea chromosome by a two-step method termed gene 
15 replacement. 

In the first step of gene replacement, an integration vector is constructed through a 
multi-step cloning approach that places a heterologous gene or fragment thereof between two 
segments of DNA having sequences which are identical to those that immediately border (on 
each side) the resident polynucleotide sequence to be replaced. Construction of such a vector 
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may be achieved by any means known to those of ordinary skill in art. For example, 
nucleotide sequences which flank the gene to be replaced can be generated by PGR 
amplification using chromosomal DN A as template and primers which hybridize to the 
chromosomal sequences immediately upstream and downstream of the flanking sequences of 

5 interest. The length of the flanking sequences is not critical to the practice of the invention 

but preferably is about 20-5000 base pairs (bp), more preferably about 100-5000 bp, and even 
more preferably about 500-5000 bp. A most preferred length of flanking sequence is about 
750-1500 bp. Primers used for such amplifications may also comprise convenient restriction 
sites to facilitate cloning of the amplified sequences into suitable preparative vectors, to 

10 facilitate insertion of the heterologous sequence of interest between the flanking sequences 
and/or to facilitate subcloning of the entire group of sequences (5 '-flanking 
region/heterologous polynucleotide sequence of interest/flemking region-3') into suitable 
vectors for integration. The desired heterologous polynucleotide sequences may be generated 
in a like manner. 

1 5 The integration vectors are constructed to also comprise a fragment of DNA 

containing at least one origin of replication that is functional in an intermediate host but is 
non-functional or poorly functional in the production host. The vectors further comprise one 
or more fragments of DNA conferring resistance to an antibiotic, of which at least one 
functions in the intermediate host and at least one functions in the production host. Preferred 

20 integration vectors comprise the ColEl and pIJlOl origins of replication, as found in piasmid 
pCS5 (J. Vara et al,J, Bacteriol 171 :5872-5881 (1989)). A particularly preferred vector 
carries a DNA fragment conferring resistance to thiostrepton £md ampicillin. However, those 
skilled in the art understand that the particular antibiotic resistance genes and origins of 
replication identified above are necessary only inasmuch as they allow for the generation and 

25 selection of the desired recombinant plasmids and host cells. Other markers and origins of 
replication may also be used in the practice of the invention. 

When the resident domains of a PKS are functional components of large 
multifunctional polypeptides, care must be taken in the construction of the integration 
piasmid so that the heterologous DNA fragment encoding the heterologous AT domain is 

30 positioned in the correct orientation and reading frame to its flanking DNA segments so that 
upon trsmslation from the beginning of the coding sequence, an enzymatically functional 
protein is produced. The correct positioning becomes immediately apparent from knowledge 
of the nucleotide sequences of the host PKS genes and the heterologous genes used for gene 
replacement. 

35 In the second step, each of the integration vectors carrying a related gene or fragment 

thereof is independently introduced into a host strain and recombination between each of the 
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genomic fragments in the integration plasmid and its corresponding homologous fragment in 
the host strain chromosome is allowed to occur. This procedure results in the exchange of the 
resident AT-encoding DN A in the chromosome for its heterologous counterpart. The general 
scheme for gene replacement by homologous recombination is outlined in FIG. 5. 
5 Procedures to introduce DNA into polyketide-producing microorganisms and to facilitate 
homologous recombination are described herein. However, those skilled in the art 
imderstand that alternative procedures for introducing DNA into a polyketide-producing 
microorganism, such as electroporation, transduction, or conjugation, are well known and 
may also be used in the practice of the invention. Procedures for cultivating polyketide- 
10 producing microorganisms, as well as methods to recover novel polyketides produced from 
modified strains, to purify such compounds and to confirm the identity of those compounds 
(such as by mass spectrometry or NMR) are well-known to those of ordinary skill in the art. 

Although the present invention is described in the Examples that follow in terms of 
preferred embodiments, they are not to be regarded as limiting the scope of the invention. 
15 The descriptions that follow serve to illustrate the principles and methodologies involved in 
creating novel derivatives of erythromycin. Whereas the examples below describe the 
replacement of the Sac, erythraea ATI, AT2, and AT4-encoding DNA fragments with a 
heterologous DNA fragment which encodes either an AT domain that specifies incorporation 
of malonate (malonate-AT) or an AT domain that specifies incorporation of ethylmalonate 
20 (ethylmalonate-AT), those skilled in the art understand that one or more fragments of 
heterologous DNA encoding malonate, ethylmalonate, allylmalonate, and/or 
hydroxymalonate (tartronate)-AT domains can be used to replace the other AT-encoding 
DNA fragments of the erythromycin PKS in Sac. erythraea to result in the production of 
other novel erythromycin derivatives. For example, novel erythromycins produced when 
25 resident AT-encoding DNA fragments in the erythromycin PKS (eryPKS) are independently 
replaced with heterologous DNA fragments specifying malonate and/or ethylmalonate as the 
extender unit are shown in Table 1 . 

In particular, those skilled in the art understand that following the methods described 
herein for replacement of a single resident AT-encoding DNA fragment in the eryPKS, 
30 replacements of two resident AT-encoding DNA fragments with heterologous DNA 

fragments (specifying malonate, ethylmalonate, allylmalonate, and/or hydroxymalonate -AT 
domains) in stepwise fashion are also possible and result in the formation of novel 
disubstituted erythromycins. Similarly, trisubstituted erythromycins, tetrasubstituted 
erythromycins, pentasubstituted erythromycins and hexasubstituted erythromycins can also 
35 be made by replacement of three, four, five and six resident AT-encoding DNA firagments in 
the eryPKS, respectively, with heterologous AT-encoding DNA fragments as described 
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herein. Therefore, all substitutions of AT-encoding DNA fragments in the eryPKS with 
heterologous AT-encoding DNA fragments (yielding all varieties of proton, ethyl, allyl, and 
hydroxyl substituted erythromycin derivatives) are within the scope of the present invention. 
Examples of compounds produced by such replacements include but are not limited to those 
5 shown in Table 1 below. 

Table 1 

Structures from Changes at Side Chain Positions 




A. Single Changes 

R2 R2 Rf Ri Name 

H Me Me Me" Me* Me 

15 Et Me Me Me Me Me 

Me H Me Me Me Me 

Me Et Me Me Me Me 

Me Me H Me Me Me 

Me Me Et Me Me Me 

20 Me Me Me H Me Me 

Me Me Me Et Me Me 

Me Me Me Me H Me 

Me Me Me Me Et Me 

Me Me Me Me Me H 

25 Me Me Me Me Me Et 

B. Two Changes 
H Me Me Me Me Et 
H Me Me Me Et Me 

30 H Me Me Et Me Me 

H Me Et Me Me Me 

H Et Me Me ME Me 

H Me Me Me Me H 

H Me Me Me H Me 

35 H Me Me H Me Me 

H Me H Me Me Me 



12-Desmethylerythromycin A 
12-Desmethyl-12-ethylerythromycin A 
lO-Desmethylerythromycin A 
10-Desmethyl-lO-ethylerythromycin A 
8-Desmethylerythromycin A 
8-Desmethyl-8-ethyleiythromycin A 
6-Desmethylerythromycin A 
6-DesmethyI-6-ethylerythromycin A 
4-Desmethylerythromycin A 
4-Desmethyl-4-ethylerythromycin A 
2-Desmethylerythromycin A 
2-Desmethyl-2-ethylerythromycin A 



2,12-Didesmethyl-2-ethylerythromycin A 
4,12-DidesmethyI-4-ethylerythromycin A 
6,1 2-Didesmethyl-6-ethylerythromycin A 
8,12-Didesmethyl-8-ethylerythromycin A 
1 0, 1 2-Didesmethy 1- 1 0-ethy lerythromycin A 
2,12-Didesmethylerythromycin A 
4, 1 2-Didesmethy lerythromycin A 
6, 12-Didesmethy lerythromycin A 
8,1 2-Didesmethy lerythromycin A 



RNJVmCID: cWO 



G851695A2 I > 



wo 98/51695 



PCT/US98/09518 



23 





H 


H 


Me 


Me 


Me 


Me 




Me 


H 


Me 


Me 


Me 


Et 




Me 


H 


Me 


Me 


Et 


Me 




Me 


H 


Me 


Et 


Me 


Me 


5 


Me 


H 


Et 


Me 


Me 


Me 




Me 


H 


Me 


Me 


Me 


H 




Me 


H 


Me 


Me 


H 


Me 




Me 


H 


Me 


H 


Me 


Me 




Me 


H 


H 


Me 


Me 


Me 


10 


Me 


Me 


H 


Me 


Me 


Et 




Me 


Me 


H 


Me 


Et 


Me 




Me 


Me 


H 


Et 


Me 


Me 




Me 


Me 


H 


Me 


Me 


H 




Me 


Me 


H 


Me 


H 


Me 


15 


Me 


Me 


H 


H 


Me 


Me 




Me 


Me 


Me 


H 


Me 


Et 




Me 


Me 


Me 


H 


Et 


Me 




Me 


Me 


Me 


H 


Me 


H 




Me 


Me 


Me 


H 


H 


Me 


20 


Me 


Me 


Me 


Me 


H 


Et 




Me 


Me 


Me 


Me 


H 


H 




Et 


Me 


Me 


Me 


Me 


Et 




Et 


Me 


Me 


Me 


Et 


Me 




Et 


Me 


Me 


Et 


Me 


Me 


25 


Et 


Me 


Et 


Me 


Me 


Me 




Et 


Et 


Me 


Me 


Me 


Me 




Et 


Me 


Me 


Me 


Me 


H 




Et 


Me 


Me 


Me 


H 


Me 




Et 


Me 


Me 


H 


Me 


Me 


30 


Et 


Me 


H 


Me 


Me 


Me 




Et 


H 


Me 


Me 


Me 


Me 




Me 


Et 


Me 


Me 


Me 


Et 




Me 


Et 


Me 


Me 


Et 


Me 




Me 


Et 


Me 


Et 


Me 


Me 


35 


Me 


Et 


Et 


Me 


Me 


Me 




Me 


Et 


Me 


Me 


Me 


H 




Me 


Et 


Me 


Me 


H 


Me 




Me 


Et 


Me 


H 


Me 


Me 




Me 


Et 


H 


Me 


Me 


Me 


40 


Me 


Me 


Et 


Me 


Me 


Et 




Me 


Me 


Et 


Me 


Et 


Me 




Me 


Me 


Et 


Et 


Me 


Me 




Me 


Me 


Et 


Me 


Me 


H 




Me 


Me 


Et 


Me 


H 


Me 


45 


Me 


Me 


Et 


H 


Me 


Me 




Me 


Me 


Me 


Et 


Me 


Et 




Me 


Me 


Me 


Et 


Et 


Me 




Me 


Me 


Me 


Et 


Me 


H 




Me 


Me 


Me 


Et 


H 


Me 


50 


Me 


Me 


Me 


Me 


Et 


Et 




Me 


Me 


Me 


Me 


Et 


H 
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Three Changes 








H 


H 


Me 


Me 


Me 


Et 



10,12-DidesmethyleTythromycin A 
2, 1 0-Didesmethyl-2-ethy lerythromycin A 
4,1 0-Didesmethyl-4-ethylerythromycin A 
6,10-Didesmethyl-6-ethylerythromycin A 
8,1 0-DidesmethyI-8-ethylery thromycin A 
2,10-Didesmethylerythromycin A 
4,10-Didesmethylerythromycin A 
6, 1 0-Didesmethylerythromycin A 
8,10-Didesmethylerythromycin A 
2,8-Didesmethyl-2-ethylerythromycin A 
4,8-Didesmethyl-4-ethylerythromycin A 
6,8-Didesmethyl-6-ethylerythromycin A 
2,8-Didesmethylerythromycin A 
4,8-Didesmethylerythromyciii A 
6,8-Didesmethylerythromycin A 
2,6-Didesmethyl-2-ethylerythromycin A 
4,6-Didesinethyl-4-ethylerythromycin A 
2,6-Didesmethylerythromycin A 
4,6-Didesmethylerythromycin A 
2,4,-Didesmethyl-2-ethylerythromycin A 
2,4,-Didesmethylerythromycin A 
2,12-Didesmethyl-2,12-diethylerythromycin A 
4,1 2-Didesmethyl-4, 1 2-diethylerythromycin A 
6, 1 2-Didesmethyl-6, 1 2-diethyIerythromycin A 
8,1 2-Didesmethyl-8, 1 2-diethylerythromycin A 
1 0, 1 2-Didesmethy 1- 1 0, 1 2-diethylerythromycin A 
2, 1 2"Didesmethyl- 1 2-ethy lerythromycin A 
4, 1 2-Didesmethy 1- 1 2-ethy lerythromycin A 
6, 1 2-Didesmethyl- 1 2-ethyleiythromycin A 
8, 1 2-Didesmethyl- 1 2-ethylery thromycin A 
1 0, 1 2-Didesmethyl- 1 2-ethy lerythromycin A 
2, 1 0-Didesmethyl-2,1 0-diethy lerythromycin A 
4, 1 0-Didesmethy 1-4, 1 0-diethy lerythromycin A 
6, 1 0-Didesmethyl-6, 1 0-diethy lerythromycin A 
8, 1 0-Didesmethyl-8, 1 0-diethylerythromycin A 
2, 1 0-Didesmethy 1- 1 0-ethylery thromycin A 
4, 1 0-Didesmethyl- 1 0-ethylerythromycin A 
6,1 0-Didesmethy 1-10-ethylerythromycin A 
8, 1 0-Didesmethyl- 1 0-ethylerythromycin A 
2,8-Didesmethyl-2,8-diethylerythromycin A 
4,8-Didesmethyl-4,8-diethylerythromycin A 
6, 8-Didesmethy 1-6, 8-diethy lerythromycin A 
2,8-Didesmethyl-8-ethylerythromycin A 
4,8-Didesmethyl-8-ethylerythromycin 
6,8-Didesmethyl-8-ethylerythromycin 
2,6-Didesmethyl-2,6-diethylerythromycin A 
4,6-Didesmethyl-4,6-diethyler5^thromycin A 
2,6-Didesmethyl-6-ethylerythromycin A 
4,6-Didesmethyl-6-ethylerythromycin 
2,4-Didesmethyl-2,4-diethylerythromycin A 
2,4-Didesmethyl-4-ethylerythromycin A 



2, 1 0, 1 2-Tridesmethyl-2-Ethy lerythromycin A 
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2,10,12-Tridesmethylerythromycin A 

4, 1 0, 1 2-Tridesmethyl-4-Ethylerythromycin A 

4,10,12-Tridesmethylerythromycin A 

6, 1 0, 1 2-Tridesmethy 1-6-Ethy lerythromycin A 

6,10,1 2-Tridesmethylerythromycin A 

8, 1 0, 1 2-Tridesmethy 1-8-ethy lerythromycin A 

8, 1 0, 1 2-Tridesmethy lery thromycin A 

2,10,1 2-Tridesmethyl-2, 1 2,-diethylerythromycin A 

2, 1 0, 1 2-Tridesmethy 1- 1 2-ethy lerythromycin A 

4, 1 0, 1 2-Tridesmethy 1-4, 1 2-diethy lerythromycin A 

4, 1 0, 1 2-Tridesmethy 1- 1 2-ethylerythromycin A 

6, 1 0, 1 2-Tridesmethy 1-6, 1 2-diethylerythromycin A 

6,10,1 2-Tridesmethy 1- 1 2-ethy lery thromycin A 

8,1 0, 1 2-Tridesmethyl-8, 1 2-diethylerythromycin A 

8,10,1 2_Tridesmethyl- 1 2-ethylerythromy cin A 

2, 1 0, 1 2-Tridesmethyl-2, 1 0-diethylerythromycin A 

2,1 0,1 2-Tridesmethy 1-1 0-ethylerythromycin A 

4,10,1 2-Tridesmethy 1-4, 1 0-diethylerythromycin A 

4,10,1 2-Tridesmethy 1- 1 0-ethylerythromycin A 

6,1 0,1 2-Tridesmethy 1-6,1 0-diethylerythromycin A 

6,1 0,1 2-Tridesmethy 1-1 0-ethylerythromycin A 

8,10,1 2-Tridesmethy 1-8, 1 0-diethylerythromycin A 

8,10,1 2-Tridesmethy 1- 1 0-ethylerythromycin A 

2, 1 0, 1 2-Tridesmethy 1-2, 1 0, 1 2-triethy lerythromycin A 

2, 1 0, 1 2-Tridesmethy 1- 1 0, 1 2-diethylerythromycin A 

4, 1 0, 1 2-Tridesmethy 1-4, 1 0, 1 2-triethy lerythromycin A 

4,10,1 2-Tridesmethy 1- 1 0, 1 2,-diethy lerytiiromycin A 

6, 1 0, 1 2-Tridesmethy 1-6, 1 0, 1 2-triethylerythromycm A 

6,10,1 2-Tridesmethy 1- 1 0, 1 2-diethylerythromycin A 

8 , 1 0, 1 2-Tridesmethy 1-8 , 1 0, 1 2-triethy lery thromycin A 

8,10,1 2-Tridesmethyl- 1 0, 1 2-diethylerythromycin A 

2,8,1 2-Tridesmethyl-2-ethy lerythromycin A 

2,8,1 2-Tridesmethy lerythromycin A 

4,8,1 2-Tridesmethy 1-4- ethy lery thromycin A 

4,8,1 2-Tridesmethy lerythromycin A 

6,8,1 2-Tridesmethy 1-6- ethy lerythromycin A 

6,8,1 2-Tridesmethylerythromycin A 

2,8, 1 2-Tridesmethy 1-2, 1 2-diethylerythromycin A 

2,8, 1 2-Tridesmethyl- 1 2-ethylerythromycin A 

4,8, 1 2-Tridesmethyl-4, 1 2-diethylerythromycin A 

4,8,1 2-Tridesmethyl- 1 2-ethylerythromycin A 

6,8, 1 2-Tridesmethy 1-6, 1 2-diethylerythromycin A 

6,8, 1 2-Tridesmethyl- 1 2-ethylerythromycin A 

2,8, 1 2-Tridesmethyl-2,8-diethylerythromycin A 

2,8, 1 2-Tridesmethy 1-8-ethy lerythromycin A 

4,8, 1 2-Tridesmethy 1-4,8-diethylerythromycin A 

4,8,1 2-Tridesmethyl-8-ethylerythromycin A 

6,8, 1 2-Tridesmethyl-6,8-diethy lerythromycin A 

6,8, 1 2-Tridesmethyl-8-ethy lerythromycin A 

2,8, 1 2-Tridesmethyl-2,8, 1 2-triethylerythromycin A 

2,8 , 1 2-Tridesmethy 1-8, 1 2-diethylerythromycin A 

4,8, 1 2-Tridesmethy 1-4,8, 1 2-triethylerythromycin A 

4,8, 1 2-Tridesmethyl-8, 1 2-diethylerythromycin A 

6,8, 1 2-Tridesmethyl-6,8, 1 2-triethylerythromycin A 
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Et Me Et H Me Me 6,8,12.Tridesmethyl-8,12-diethylerythromycin A 

H Me Me H Me Et 2,6,12-Tridesmethyl-2-ethylerythromycin A 

H Me Me H Me H 2,6,12-Tridesmethylerythromycin A 

H Me Me H Et Me 4,6,12-TridesmethyM-ethylerythromycin A 

5 H Me Me H H Me 4,6,12-Tridesmethylerythromycin A 

Et Me Me H Me Et 2,6,12-Tridesmethyl-2,12-diethylerythromycin A 

Et Me Me H Me H 2,6,12-Tridesmethyl-12-ethylerythromycin A 

Et Me Me H Et Me 4,6,12-Tridesmethyl-4,12« diethylerythromycin A 

Et Me Me H H Me 4,6,12-Tridesmethyl-12-ethyierythroniycin A 

10 H Me Me Et Me Et 2,6,12-Tridesmethyl-2,6-diethylerythromycin A 

H Me Me Et Me H 2,6,12-Tridesniethyl-6,-ethylerythromycin A 

H Me Me Et Et Me 4,6,12-Tridesniethyl-4,6-diethylerythromycin A 

H Me Me Et H Me 4,6,12-Tridesmethyl-6-ethylerythromycin A 

Et Me Me Et Me Et 2,6,12-Tridesmethyl-2,6,12-triethylerythromycin A 

15 Et Me Me Et Me H 2,6,12-Tridesmethyl-6,12-diethylerythromycin A 

Et Me Me Et Et Me 4,6,12-Tridesmethyl-4,6,I2-triethylerythromycin A 

Et Me Me Et H Me 4,6,12-Tridesmethyl-6,12-diethylerythromycm A 

H Me Me Me H Et 2,4,12-Tridesmethyl-2-ethylerythromycin A 

H Me Me Me H H 2,4,12-Tridesmethylerythromycin A 

20 Et Me Me Me H Et 2,4,12-Tridesmethyl-2,12-diethylerythromycin A 

Et Me Me Me H H 2A12-Tridesmethyl-12-ethylerythromycin A 

H Me Me Me Et Et 2,4,12-Tridesmethyl-2,4-diethylerythromycin A 

H Me Me Me Et H 2,4,12-Tridesmethyl-4-ethylerythromycin A 

Et Me Me Me Et Et 2,4,12-Tridesmethyl-2,4,12-triethylerythromycin A 

25 Et Me Me Me Et H 2,4,12-Tridesmethyl-2,12-diethylerythromycin A 

Me H H Me Me Et 2,8,10-Tridesmethyl-2-ethylerythromycin A 

Me H H Me Me H 2,8,10-Tridesmethylerythromycin A 

Me H H Me Et Me 4,8,10-Tridesmethyl-4-ethylerythromycin A 

Me H H Me H Me 4,8,10-Tridesmethylerythromycm A 

30 Me H H Et Me Me 6,8,10-Tridesmethyl-6-ethylerythromycin A 
Me H H H Me Me 6,8,10-Tridesmethylerythromycin A 

Me Et H Me Me Et 2,8,10-TridesmethyI-2,10-diethylerythromycin A 

Me Et H Me Me H 2,8,10-Tridesmethyl-lO-ethylerythromycin A 

Me Et H Me Et Me 4,8,10-Tridesmethyl-4J0-diethylerythromycin A 

35 Me Et H Me H Me 4,8,10-TridesmethyHO-ethylerythromycin A 

Me Et H Et Me Me 6,8,10-Tridesmethyl-6,10-diethylerythromycin A 
Me Et H H Me Me 6,8,10-Tridesmethyl-lO-ethylerythromycin A 

Me H Et Me Me Et 2,8,10-Tridesmethyl-2,8-diethylerythroinycin A 

Me H Et Me Me H 2,8,10-Tridesmethyl-8-ethylerythromycin A 

40 Me H Et Me Et Me 4,8,10-Tridesmethyl-4,8-diethylerythromycin A 

Me H Et Me H Me 4,8,10-Tridesmethyl-8-ethylerythromycin A 

Me H Et Et Me Me 6,8,10-Tridesmethyl-6,8-diethylerythromycin A 
Me H Et H Me Me 6,8,10-Tridesinethyl-8-ethylerythromycin A 

Me Et Et Me Me Et 2,8,10-Tridesmethyl-2,8,10-triethylerythromycin A 

45 Me Et Et Me Me H 2,8,10-Tridesmethyl-8,10-diethylerythromycin A 

Me Et Et Me Et Me 4,8,10-Tridesmethyl-4,8,10-triethylerythromycin A 

Me Et Et Me H Me 4,8,10-Tridesinethyl-8-10-diethylerythromycin A 

Me Et Et Et Me Me 6,8,10-Tridesmethyl-6,8,10-triethylerythromycin A 
Me Et Et H Me Me 6,8,10-Tridesmethyl-8,10-diethylerythromycin A 
50 Me H Me H Me Et 2,6,10-Tridesmethyl-2-ethyierytiiromycin A 
Me H Me H Me H 2,6,10-Tridesmethylerythromycin A 
Me H Me H Et Me 4,6,10-Tridesmethyl-4-ethylerythromycin A 
Me H Me H H Me 4,6,10-Tridesmethylerythromycin A 
Me Et Me H Me Et 2,6,10-Tridesmethyl-2,l-diethylerythromycin A 
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D. Four Changes 
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Me 


Me 


Et 



2,6, 1 0-Tridesmethyl- 1 0-ethy lerythromy cin A 
4,6, 1 0-Tridesmethy 1-4, 1 0-diethylerythromycin A 
4,6,1 0-Tridesmethyl- 1 0-ethylerythromycin A 
2,6, 1 0-Tridesmethy 1-2,6-diethylerythromycin A 
2,6, 1 0-Tridesmethyl-6-ethy lerythromycin A 
4,6, 1 0-Tridesmethy l-4,6,diethylerythromycin A 
4,6, 1 0-Tridesmethy 1-6-ethylerythromycin A 
2,6, 1 0-Tridesmethyl-2,6, 1 0-triethylerythromy cin A 
2,6,1 0-Tridesmethy 1-6, 1 0-diethylerythromycin A 
4,6, 1 0-Tridesmethy 1-4,6, 1 0-triethylerythromycin A 
4,6, 1 0-Tridesmethyl-6, 1 0-diethylerythromycin A 
2,4, 1 0-Tridesmethyl-2-ethy lerythromycin A 
2,4,10-Tridesmethylerythromycin A 
2,4, 1 0-Tridesmethyl-2, 1 0-diethy lerythromycin A 
2,4, 1 0-Tridesmethyl- 1 0-ethylerythromycin A 
2,4, 1 0-Tridesmethy 1-2 ,4-diethy lerythromycin A 
2,4,1 0-Tridesmethy 1-4-ethy lerythromycin A 
2,4, 1 0-Tridesmethyl-2,4, 1 0-triethylerythromycin A 
2,4, 1 0-Tridesmethy 1-4, 1 0-diethylerythromycin A 
2,6,8-Tridesmethyl-2-ethylerythromycin A 
2,6,8-Tridesmethylerythromycin A 
4,6,8-Tridesmethyl-4-ethylerythromycin A 
4,6,8-Tridesmethylerythromycin A 
2,6,8-Tridesmethyl-2,8-diethylerythromycin A 
2,6,8-Tridesmethyl-8-ethylerythromycin A 
4,6,8-Tridesmethyl-4,8-diethylerythromycin A 
4,6,8-Tridesmethyl-8-ethylerythromycin A 
2,6,8-Tridesmethyl-2,6-diethylerythromycin A 
2,6,8-TridesmethyI-6-ethylerythromycin A 
4,6,8-Tridesmethyl-4,6-diethylerythromycin A 
4,6,8-Tridesmethyl-6-ethylerythromycin A 
2,6,8-Tridesmethyl-2,6,8-triethylerythromycin A 
2,6,8-Tridesmethyl-6,8-diethylerythromycin A 
4,6,8-Tridesmethyl-4,6,8-triethylerythromycin A 
4,6,8-Tridesmethyl-6,8-triethylerythromycin A 
2,4,8- Tridesmethyl-2-ethylerythromycin A 
2,4,8-Tridesmethylerythromycin A 
2,4,8-Tridesmethyl-2,8-diethylerythromycin A 
2,4,8-Tridesmethyl-8-ethylerythromycin A 
2,4,8-Tridesmethyl-2,4-diiethylerythromycin A 
2,4,8-Tridesmethyl-4-ethylerythromycin A 
2,4,8-Tridesmethyl-2,4,8-triethylerythromycin A 
2,4,8-Tridesmethyl-4,8-diethylerythromycin A 
2,4,6-Tridesmethyl-2-ethylerythromycin A 
2,4,6-Tridesmethylerythromycin A 
2,4,6-Tridesmethyl-2,6-diethylerythromycin A 
2,4,6-Tridesmethyl-6-ethyl erythromycin A 
2,4,6-Tridesmethyl-2,4-diethyl erythromycin A 
2,4,6-Tridesmethyl-4-ethyl erythromycin A 
2,4,6-TridesmethyI-2,4,6-triethyl erythromycin A 
2,4,6-Tridesmethyl-4,6-diethyl erythromycin A 



2,8,10,12-Tetradesmethyl-2-ethylerythromycin A 
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2,8,10,12-Tetradesmethyl-2,8-diethylerythromycin A 
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2,8, 1 0, 1 2-Tetradesmethyl-8-ethylerythromycin A 
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2,6, 1 0, 1 2-Tetradesmethyl-2,8, 1 0-triethylerythromycin A 
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2,6, 1 0, 1 2-Tetradesmethyl-8, 1 0-diethylerythromycin A 
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4,8,10,1 2-Tetradesmethy 1-4,8, 1 0-triethylerythromycin A 
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4,8,10,12-Tetradesmethyl-8,1 0-diethyler5^hromycin A 
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6,8, 1 0, 1 2-Tetradesmethyl-6,8, 1 0-triethylerythromycin A 
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6,8,10,1 2-Tetradesmethyl-8, 1 0-diethylerythromycin A 
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2,8, 1 0, 1 2-Tetradesmethyl-2, 1 2-diethylerythromycin A 
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2,8, 1 0, 1 2-Tetradesmethyl- 1 2-ethylerythromycin A 
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4,8, 1 0, 1 2-Tetradesmethyl-4, 1 2-diethylerythromycin A 
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4,8, 1 0, 1 2-Tetradesmethyl- 1 2-ethylerythromycin A 
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6,8, 1 0, 1 2-Tetradesmethyl-6, 1 2-diethylerythromycin A 
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6,8,10,1 2-Tetradesmethyl- 1 2-ethylerythromycin A 
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2,6, 10,1 2-Tetradesmethyl-2, 1 0, 1 2-triethylerythromycin A 
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2,6, 1 0, 1 2-Tetradesmethyl- 1 0, 1 2-diethylerythromycin A 
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6,8, 1 0, 1 2-Tetradesmethyl-6, 1 0, 1 2-triethylerythromycin A 


35 


Et 


Et 


H 


H 


Me 


Me 


6,8, 1 0, 1 2-Tetradesmethyl- 1 0, 1 2-diethylerythromycin A 
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2,6, 1 0, 1 2-Tetradesmethyl- 1 0-ethylerythromycin A 
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4,6, 10,1 2-Tetradesmethyl-4, 1 0-diethylerythromycin A 
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4,6, 10,1 2-Tetradesmethyl-4,6, 1 0-triethylerythromycin A 
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4,6, 10,1 2-Tetradesmethy 1-6, 1 0-diethy lerythromycin A 
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r y \^ J A ^ K A VA IbAVA WkJA AX^/ vA * T <^ ^ A WA T VA A A Vp' A A AT WA a AAA 




Et 


Et 


Me 


H 


Me 


Et 
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2,6,8,1 2-Tetradesmethyl-6,8-diethylerythromycin A 
4,6,8, 1 2-Tetradesmethyl-4,6,8-triethylerythromycin A 
4,6,8, 12-Tetradesmethyl-6,8-dietiiylerythromycin A 
2,6,8, 1 2-Tetradesmethyl-2, 1 2-diethy lerythromycin A 
2,6,8, 1 2-Tetradesmethyl- 1 2-ethylerythromycin A 
4,6,8, 1 2-Tetradesmethyl-4, 1 2-diethy lerythromycin A 
4,6,8 , 1 2-Tetradesmethyl- 1 2-ethylerythromycin A 
2,6,8,12-Tetradesmethyl-2,8,12-triethylerythromycin A 
2,6,8,1 2-Tetradesmethy 1-8,1 2-diethy lerythromycin A 
4,6,8, 1 2-Tetradesmethy 1-4,8, 1 2-triethylerythromycin A 
4,6,8, 1 2-Tetradesmethyl-8, 1 2-diethylerythromycin A 
2,6,8, 1 2-Tetradesmethy 1-2,6, 1 2-triethy lerythromycin A 
2,6,8, 1 2-Tetradesmethyl-6, 1 2-diethylerythromycin A 
4,6,8, 1 2-Tetradesmethyl-4,6, 1 2-triethy lerythromycin A 
4,6,8, 1 2-Tetradesmethyl-6, 1 2-diethylerythromycin A 
2,6,8, 1 2-Tetradesmethyl-2,6,8, 1 2-tetraethy lerythromycin A 
2,6,8, 1 2-Tetradesmethy 1-6,8, 1 2-triethy lerythromycin A 
4,6,8,1 2-Tetradesmethyl-4,6,8, 1 2-tetraethylerythromycin A 
4,6,8, 1 2-Tetradesmethy 1-6,8, 1 2-triethylerythromycin A 
2,4,6, 1 2-Tetradesmethyl-2-ethy lerytiiromycin A 
2,4,6,1 2-Tetradesmethy lerythromycin A 
2,4,6, 1 2-Tetradesmethyl-2,6-diethy lerythromycin A 
2,4,6, 1 2-Tetradesmethy 1-6-ethylerythromycin A 
2,4,6, 1 2-Tetradesmethy 1-2,4-diethy lerythromycin A 
2,4,6, 12-Tetradesmethyl-4-ethy lerythromycin A 
2,4,6, 1 2-Tetradesmethy 1-2,4,6-triethy lerythromycin A 
2,4,6, 1 2-Tetradesmethy 1-4,6-diethy lerythromycin A 
2,4,6, 1 2-Tetradesmethy 1-2, 1 2-diethylerythromycin A 
2,4,6, 1 2-Tetradesmethyl- 1 2-ethyierythromycin A 
2,4,6, 1 2-Tetradesmethyl-2,6, 1 2-triethylerythromycin A 
2,4,6, 1 2-Tetradesmethyl-6, 1 2-diethylerythromycin A 
2,4,6, 1 2-Tetradesmethyl-2,4, 1 2-triethylerythromycin A 
2,4,6, 12-Tetradesmethyl-diethy lerythromycin A 
2,4,6, 1 2-Tetradesmethyl-2,4,6, 1 2-tetraethylerythromycin A 
2,4,6, 1 2-Tetradesmethy 1-4,6, 1 2-triethylerythromycin A 
2,6,8, 1 0-Tetradesmethy 1-2-ethylerythromycin A 
2,6,8, 1 0-Tetradesmethylery thromycin A 
4,6,8, 1 0-Tetradesmethy 1-4-ethy lerythromycin A 
4,6,8, 10-Tetradesmethylerythromycin A 
2,6,8, 1 0-Tetradesmethyl-2,8-diethylerythromycin A 
2,6,8,1 0-Tetradesmethyl-8-ethylerythromycin A 
4,6,8, 1 0-TetradesmethyM,8-diethylerythromycin A 
4,6,8, 1 0-Tetradesmethyl-8-ethylerythromycin A 
2,6,8, 1 0-Tetradesmethy 1-2,6-diethylerythromycin A 
2,6,8,1 0-Tetradesmethy 1-6-ethylerythromycin A 
4,6,8,1 0-Tetradesmethyl-4,6-diethylerythromycin A 
4,6,8,1 0-Tetradesmethyl-6-ethy lerythromycin A 
2,6,8,1 0-Tetradesmethyl-2,6,8-triethylerythromycin A 
2,6,8, 1 0-Tetradesmethyl-6,8-diethylerythromycin A 
4,6,8,1 0-Tetradesmethyl-4,6,8-triethylerythromycin A 
4,6,8, 1 0-Tetradesmethy 1-6,8-diethylerythromycin A 
2,6,8,10-Tetradesmethyl-2,10-diethylerythromycin A 
2,6,8, 1 0-Tetradesmethyl- 1 0-ethylerythromycin A 
4,6,8, 1 0-Tetradesmethy 1-4, 1 0-diethy lerythromycin A 
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E. Five Changes 









H H H H H Me 4,6,8,10,12-Pentadesmethylerythromycin A 

Et H H H H Me 4,6,8,10,12-Pentadesmethyl-12-ethylerythromycin A 

50 H Et H H H Me 4,6,8, 10,12-Pentadesmethyl-l 0-ethylerythromycin A 

H H Et H H Me 4,6,8,10,12-Pentadesmethyl-8-ethylerythromycin A 

H H H Et H Me 4,6,8,1 0,12-Pentadesmethyl-6-ethylerythromycin A 

H H H H Et Me 4,6,8,10,12-Pentadesmethyl-4-ethyIerythromycin A 

Et Et H H H Me 4,6,8, 10,12-Pentadesmethyl-10,12-diethy lerythromycin A 
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4,6,8, 1 0, 1 2-Pentadesmethy 1-4,6.8, 1 0, 1 2-pentaethy lerythromycin A 




H 


H 


H 


H 


Me 


H 


2,6,8, 1 0, 1 2-Pentadesmethy lerythromycin A 




Et 


H 


H 


H 


Me 


H 


2,6,8, 1 0, 1 2-Pentadesmethy 1- 1 2-ethy lerythromycin A 
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Et 


Et 


H 


Me 


H 


2,6,8, 10,1 2-Pentadesmethy 1-8.1 0-diethylerythromycin A 




H 


Et 


H 


Et 


Me 


H 


2.6,8. 1 0. 1 2-Pentadesmethyl-6, 1 0-diethylerythromycin A 




H 


Et 


H 


H 


Me 


Et 


2,6,8. 1 0, 1 2-Pentadesmethyl-2. 1 0-diethylerythromycin A 




H 


H 


Et 


Et 


Me 


H 


2.6,8, 10,1 2-Pentadesmethy 1-6,8-diethy lerythromycin A 


40 


H 


H 


Et 


H 


Me 


Et 


2,6,8. 1 0, 1 2-Pentadesmethy 1-2,8-diethy lerythromycin A 




H 


H 


H 


Et 


Me 


Et 


2,6,8, 1 0, 1 2-Pentadesmethy 1-2.6-diethylerythromycin A 




Et 


Et 


Et 


H 


Me 


H 


2,6,8, 1 0, 1 2-Pentadesmethyl-8, 1 0, 1 2-triethylerythromycin A 




Et 


Et 


H 


Et 


Me 


H 


2,6,8, 1 0, 1 2-Pentadesmethy 1-6, 1 0. 1 2-triethylerythromycin A 




Et 


Et 


H 


H 


Me 


Et 


2.6,8, 1 0, 1 2-Pentadesmethyl-2, 1 0, 1 2-triethylerythromycin A 


45 


Et 


H 


Et 


Et 


Me 


H 


2.6.8,10,12-Pentadesmethyl-6.8,12-triethyierythromycin A 




Et 


H 


Et 


H 


Me 


Et 


2,6,8,10,12-Pentadesmethyl-2,8,12-triethylerythromycin A 




Et 


H 


H 


Et 


Me 


Et 


2,6,8, 10.12-Pentadesmethyl-2,6,l 2-triethylerythromycin A 




H 


Et 


Et 


Et 


Me 


H 


2.6.8, 1 0, 1 2-Pentadesmethy 1-6,8. 1 0-triethylerythromycin A 




H 


Et 


Et 


H 


Me 


Et 


2,6,8, 1 0, 1 2-Pentadesmethyl-2,8, 1 0-triethylerythromycin A 


50 


H 


Et 


H 


Et 


Me 


Et 


2,6,8, 1 0, 1 2-Pentadesmethyl-2,6, 1 0-triethylerythromycin A 




H 


H 


Et 


Et 


Me 


Et 


2,6,8,10,12-Pentadesmethyl-2,6,8-triethylerythromycin A 




Et 


Et 


Et 


Et 


Me 


H 


2,6,8, 1 0, 1 2-Pentadesmethyl-6,8, 1 0, 1 2-tetraethyIerythromycin A 




Et 


Et 


Et 


H 


Me 


Et 


2,6,8, 1 0, 1 2-Pentadesmethy 1-2,8, 1 0, 1 2-tetraethylerythromycin A 
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Et Et H Et Me Et 2.6.8J0,12-Pentadesmethyl-2.6.10.12-tetraethylerythromycin 

AEt H Et Et Me Et 2.6.8,10.12-Pentadesmethyl-2.6.,8.12-tetraethylervthromycin A 

H Et Et Et Me Et 2.6.8,1 0.12-Pentadesmethvl-2.6.8.10-tetraethylervthromycin A 

Et Et Et Et Me Et 2.6.8.10.12-Pentadesmethyl-2.6..8.10.12-pentaethvlervthromycin A 

5 H H H Me H H 2.4.8.10.12-Pentadesmethylervthromycin A 

Et H H Me H H 2.4.8.10.12-Pentadesmethyl-r2-ethylerythromvcin A 

H Et H Me H H 2,4.8.10,12-Pentadesmethyl-lO-ethylerythromycin A 

H H Et Me H H 2.4.8,10.12-Pentadesmethyl-8-ethylerythromvcin A 

H H H Me Et H 2.4.8.10,12-Pentadesmethyl-4-ethylerythromycin A 

10 H H H Me H Et 2.4,8.10,12-Pentadesmethyl-2-ethylerythromvcin A 

Et Et H Me H H 2.4.8.10.12-Pentadesmethvl.lO,12-diethylervthromvcin A 

Et H Et Me H H 2.4.8,10.12-Pentadesmethyl-8.12-diethylervthromycin A 

Et H H Me Et H 2,4.8,10.12-Pentadesmethyl-4.12-diethylervthromycin A 

Et H H Me H Et 2.4,8,10,12-Pentadesmethyl-2.12-diethylervthromycin A 

15 H Et Et Me H H 2,4.8,10.12-Pentadesmethyl-8.10-diethylervthromycin A 

H Et H Me Et H 2.4.8.10.12-Pentadesmethyi-4.10-diethylerythroniycin A 

H Et H Me H Et 2,4,8,1 0.12-Pentadesmethy 1-2. lO-diethylerythromycin A 

H H Et Me Et H 2,4.8.10.12-Pentadesmethyl-4.8-diethylerythromycin A 

H H Et Me H Et 2,4.8,10.12-Pentadesmethyl-2.8-diethylervthromycin A 

20 H H H Me Et Et 2,4,8,10.12-Pentadesmethyl-2.4-diethyleiythromycin A 

Et Et Et Me H H 2,4.8,10,12-Pentadesmethyl-8.10.12-triethylerythromycin A 

Et Et H Me Et H 2,4,8.10,12-Pentadesmethv!-4.10,12-triethylerythromycin A 

Et Et H Me H Et 2,4.8,10.12-Pentadesmethyl-2.10.12-triethylerythromycin A 

Et H Et Me Et H 2,4,8, 10,12-Pentadesniethy 1-4,8, 12-triethylerythromycin A 

25 Et H Et Me H Et 2,4,8,10,12-Pentadesmethyl-2.8.12-triethylerythromycin A 

Et H H Me Et Et 2,4,8,10,12-Pentadesmethyl-2,4,12-triethylerythromycin A 

H Et Et Me Et H 2,4,8,10,12-Pentadesmethyl-4,8,10-triethylerythromycin A 

H Et Et Me H Et 2,4,8,10.12-Pentadesmethyl-2.8,10-triethylerythromycin A 

H Et H Me Et Et 2,4,8.10,12-Pentadesmethyl-2.4,10-triethylerythromycin A 

30 H H Et Me Et Et 2,4.8,10,I2-Pentadesmethyl-2.4.8-triethyiervthromycin A 

Et El Et Me Et H 2,4.8,10,12-Pentadesmethyl-4.8,10,12-tetraethylervthromvcin A 

Et Et Et Me H Et 2,4.8.10.12-Pentadesmethyl-2.8,10,12-telraethylerythromvcin A 

Et Et H Me Et Et 2.4.8.10.12-Pentadesmethyl-2.4.10,12-tetraethylerythromvcin A 

Et H Et Me Et Et 2.4.8. 10.12-Pentadesmethy 1-2.4.8, 12-tetraethylervthromvcin A 

35 H Et Et Me Et Et 2,4,8.10,12-Pentadesmethyl-2,4.8,10-tetraethylerythromvcin A 

Et Et Et Me Et Et 2,4,8,10,12-PentadesmethyI-2,4,8.10,12-pentaethylerytthromycin A 

H H Me H H H 2,4,6, 10,12-Pentadesmethylerythromycin 

Et H Me H H H 2,4.6,10.12-Pentadesmethyl-12-ethylerythromycin A 

H Et Me H H H 2,4,6,10.12-PentadesmethyMO-ethylerythromycin A 

40 H H Me Et H H 2,4.6,10,12-Pentadesmethvl-6-ethylerythromvcin A 

H H Me H Et H 2.4.6.10,12-PentadesmethyI-4-ethylervthromycin A 

H H Me H H Et 2,4,6,10,12-Pentadesmethyl-2-ethylerythroinycin A 

Et Et Me H H H 2,4.6,10,12-Pentadesmethyl-10,12-diethylervthromvcin A 

Et H Me Et H H 2,4.6,10.12-Pentadesmethyi-6.12-diethylerythromycin A 

45 Et H Me H Et H 2.4.6,10.12-Pentadesinethyl-4.12-diethylerythromycin A 

Et H Me H H Et 2,4.6.10,12-Pentadesmethyl-2.12-diethylerv'thromycin A 

H Et Me Et H H 2,4.6,10,12-Pentadesmethyl-6.10-diethylerythromvcin A 

H Et Me H Et H 2,4.6,10,12-PentadesmethyI-4,10-diethylerythromycin A 

H Et Me H H Et 2,4,6,10,12-Pentadesmethyl-2,10-diethyIervthromycin A 

50 H H Me Et Et H 2,4,6,10,12-PentadesmethyI-4,6-diethylerythromycin A 

H H Me Et H Et 2,4,6,10,12-Pentadesmethyl-2,6-diethylerythromycin A 

H H Me H Et Et 2,4.6,10,12-Pentadesmethyl-2,4-diethylerythromycin A 

Et Et Me Et H H 2,4,6,10,12-Pentadesmethyl-6.10.12-triethvlerythromvcin A 

Et Et Me H Et H 2,4,6,10,12-Pentadesmethyl-4,10,12-triethylerythromvcin A 
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Et Et Me H H Et 2,4.6J0.12-PentadesmethYl-240J2-triethylerythromycin A 

Et H Me Et El H 2.4.6J0J2-Pentadesmethyl-4,6J2-triethylerythromycin A 

Et H Me Et H Et 2.4.6.10.12-Pentadesmethyl-2,6.12-triethylerythromycin A 

Et H Me H Et Et 2,4,6.10,12-Pentadesmethyl-2.4J2-triethylerythromycin A 

5 H Et Me Et Et H 2,4.6J0J2-PentadesmethyM,6J0-triethylerythromycin A 

H Et Me Et H Et 2,4,6,1 0.12-Pentadesniethyl-2,6,10-triethyIerythroniycm A 

H Et Me H Et Et 2,4,6,10J2-Pentadesmethyl-2,4.10-triethylerythromycin A 

H H Me Et Et Et 2,4,6,10J2-Pentadesmethyl-2,4,6-triethylerythromycin A 

Et Et Me Et Et H 2,4,6J0J2-Pentadesmethyl-4,6,10,12-tetraethylerythromycin A 

10 Et Et Me Et H Et 2,4,6,10J2-Pentadesmethyl-2,6J0.12-tetraethylerythroniycin A 

Et Et Me H Et Et 2,4,6,10J2-Pentadesmethyl-2,4,10,12-tetraethyierythromycin A 

Et H Me Et Et Et 2,4,6. 10J2-Pentadesmethy 1-2,4,6, 12-tetraethylerythromycin A 

H Et Me Et Et Et 2,4,6, 10 J 2-Pentadesmethy 1-2,4,6, lO-tetraethylerythromycin A 

Et Et Me Et Et Et 2,4,6,10J2-Pentadesmethyl-2,4,6,10,12-pentaethylerythromycin A 

15 H Me H H H H 2,4,6,8, 12-Pentadesmethylerythromycin A 

Et Me H H H H 2,4,6,8, 12-Pentadesmethyl-12-ethylerythromycin A 

H Me Et H H H 2,4,6,8, 12-Pentadesmethyl-8-ethylerythromycin A 

H Me H Et H H 2,4,6,8, 12-Pentadesmethyl-6-ethylerythromycin A 

H Me H H Et H 2,4,6,8, 12-Pentadesmethyl-4-ethylerythroTnycin A 

20 H Me H H H Et 2,4,6,8, 12-PentadesmethyI-2-ethylerythromycin A 

Et Me Et H H H 2,4,6,8, 12-Pentadesmethy 1-8, 12-diethylerythromycin A 

Et Me H Et H H 2,4,6,8,12-Pentadesmethyl-6,12-diethylerythromycin A 

Et Me H H Et H 2,4,6,8, 12-Pentadesmethyl-4, 12-diethylerythromycin A 

Et Me H H H Et 2,4,6,8, 12-Pentadesmethyl-2, 12-diethylerythromycin A 

25 H Me Et Et H H 2,4,6,8, 12-Pentadesmethy 1-6,8-diethylerythromycin A 

H Me Et H Et H 2,4,6,8, 12-Pentadesmethy 1-4,8-diethylerythromycin A 

H Me Et H H Et 2,4,6,8,1 2-Pentadesmethyl-2,8-diethylerythromycin A 

H Me H Et Et H 2,4,6,8, 12-Pentadesmethyl-4,6-diethylerythromycin A 

H Me H Et H Et 2,4,6,8, 12-Pentadesmethy 1-2,6-diethylerythromycin A 

30 H Me H H Et Et 2,4,6,8, 12-Pentadesmethy 1-2,4-diethylerythromycin A 

Et Me Et Et H H 2,4,6,8, 12-Pentadesmethy 1-6,8, 12-triethylerythromycin A 

Et Me Et H Et H 2,4,6,8, 12-Pentadesmethyl-4,8. 12-triethylerythromycin A 

Et Me Et H H Et 2,4,6,8, 12-Pentadesmethy 1-2,8, 12-triethylerythromycin A 

Et Me H Et Et H 2,4,6,8. 12-Pentadesmethy 1-4,6, 12-triethylerythromycin A 

35 Et Me H Et H Et 2,4,6,8, 12-Pentadesmethy 1-2,6, 12-triethylerythromycin A 

Et Me H H Et Et 2,4,6,8,1 2-Pentadesmethyl-2,4,12-triethylerythromycin A 

H Me Et Et Et H 2,4,6,8,1 2-Pentadesmethyl-4,6,8-triethylerythromycin A 

H Me Et Et H Et 2,4,6,8, 12-Pentadesmethyl-2,6,8-triethylerythromycin A 

H Me Et H Et Et 2,4,6,8, 12-Pentadesmethyl-2,4,8-triethylerythromycin A 

40 H Me H Et Et Et 2,4,6,8,1 2-Pentadesmethyl-2,4,6-triethylerythromycin A 

Et Me Et Et Et H 2,4,6,8, 12-Pentadesmethy I-4,6,8-triethylerythromycin A 

Et Me Et Et H Et 2,4,6,8, 12-Pentadesmethy 1-2.6,8, 12-tetraethylerythromycin A 

Et Me Et H Et Et 2,4,6,8,12-Pentadesmethyl-2,4,8,12-tetraethylerythromycin A 

Et Me H Et Et Et 2,4,6,8, 12-Pentadesmethy 1-2,4,6, 12-tetraethylerythromycin A 

45 H Me Et Et Et Et 2.4,6,8, 12-Pentadesmethy 1-2,4,6,8-tetraethylerythromycin A 

Et Me Et Et Et Et 2,4,6,8, 12-Pentadesmethyl-2,4,6,8,12-pentaethylerythromycin A 

Me H H H H H 2,4,6,8, lO-Pentadesmethylerythromycin A 

Me Et H H H H 2,4,6,8,1 0-Pentadesmethyl-lO-ethylerythromycin A 

Me H Et H H H 2,4,6,8, lO-Pentadesmethyl-8-ethylerythromycin A 

50 Me H H Et H H 2,4,6,8, lO-Pentadesmethyl-6-ethylerythromycin A 

Me H H H Et H 2,4,6,8,1 0-Pentadesmethyl-4-ethylerythromycin A 

Me H H H H Et 2,4,6,8, lO-Pentadesmethyl-2-ethylerythromycin A 

Me Et Et H H H 2,4,6,8, 10-Pentadesmethy 1-8. 1 0 diethylerythromycin A 

Me Et H Et H H 2,4,6,8,1 0-Pentadesmethyl-6. 10 diethylerythromycin A 
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2,4.6,8. 10-Penladesmethy 1-4, 10 diethylerythromycin A 
2,4,6,8.1 0-PentadesmethyI-2, 10 diethylerythromycin A 
2,4.6,8, 1 0-Pentadesmethyl-6,8-diethylerythromycin A 
2.4,6,8, 1 0-Pentadesmethyl-4.8-diethy lerythromycin A 
2,4,6,8, 1 0-Pentadesmethyl-2,8-diethylerythromycin A 
2,4,6,8. 1 0-Pentadesmethyl-4.6-diethylerythromycin A 
2,4,6,8, 1 0-Pentadesmethyl-2,6-diethy lerythromycin A 
2,4,6.8,1 0-Pentadesmethyl-2,4-diethyleiythromycin A 
2.4,6.8, 1 0-Pentadesmethyl-6,8, 1 0-triethy lerythromycin A 
10 Me Et El H Et _H 2,4,6,8, 10-Pentadesmethyl-4,8,10-triethy lerythromycin A 

2,4,6,8, 10-Pentadesmethy 1-2,8, 10-triethy lerythromycin A 
2,4,6,8, 1 0-Pentadesmethy 1-4,6, 1 0-triethylerythromycin A 
2,4,6,8, 1 0-Pentadesmethy 1-2,6, 1 0-triethy lerythromycin A 
2,4,6,8, 1 0-Pentadesmethy 1-2,4, 1 0-triethy lerythromycin A 
15 Me H Et Et Et H 2,4,6,8, 1 0-Pentadesmethy l-4,6,8-triethylerythromycin A 

2,4,6,8. 1 0-Pentadesmethy 1-2,6,8-triethylerythromycin A 
2,4,6,8, 1 0-Pentadesmethy 1-2,4,8-triethylerythromycin A 
2,4,6,8, 1 0-Pentadesmethy 1-2,4 ,6-triethylerythromycin A 
2,4,6,8, 1 0-Pentadesmethyl-4,6,8, 1 0-tetraethy lerythromycin A 
20 Me Et Et Et H Et 2,4,6,8, 10-Pentadesmethyl-2,6,8.10-tetraethylerythromycin A 

2,4,6,8, 1 0-Pentadesmethy 1-2,4,8, 1 0-tetraethy lerythromycin A 
2,4,6,8,1 0-Pentadesmethy 1-2,4,6, 10-tetraethy lerythromycin A 
2,4,6,8,1 0-Pentadesmethyl-2,4,6,8-tetraethylerythromycin A 
2,4,6,8,10-Pentadesmethyl-2,4,6,8,10-pentaethylerythromycin A 

25 

F. Six Chanees 

2,4,6,8,1 0, 1 2-Hexadesmethylerythromycin A 
2,4,6,8, 1 0, 1 2-Hexadesmethy 1- 1 2-ethylerythromycin A 
2 ,4,6,8, 1 0, 1 2-Hexadesmethy 1- 1 0-ethy lerythromycin A 
30 H H Et H H 2,4,6,8, 10,12-Hexadesmethyl-8-ethy lerythromycin A 

2,4,6,8, 10,12-Hexadesmethyl-6-ethylerythromycin A 
2,4,6,8. 1 0. 1 2-Hexadesmethy 1-4-ethylerythromycin A 
2,4,6,8. 1 0. 1 2-Hexadesmethyl-2-ethy lerythromycin A 
2,4,6,8, 1 0, 1 2-Hexadesmethy 1- 1 0, 1 2-diethy lerythromycin A 
35 Et H El H H H 2,4,6,8, 10,12-Hexadesmethy!-8,12-diethylerythromycin A 

2,4,6,8, 1 0, 1 2-Hexadesmethy 1-6. 1 2-diethy lerythromycin A 
2,4,6,8,1 0,12-Hexadesmethyl-4,12-diethylerythromycin A 
2,4,6,8, 1 0, 1 2-Hexadesmethyl-2, 1 2-diethy lerythromycin A 
2,4,6,8, 1 0, 1 2-Hexadesmethyl-8, 1 0-diethylerythromycin A 
40 H Et H Et H H 2,4,6,8, 10,12-Hexadesmethyl-6,10-diethylerythromycin A 

2,4,6,8, 10,1 2-Hexadesmethy 1-4, 10-diethy lerythromycin A 
2,4,6,8,1 0,1 2-Hexadesmethy 1-2,1 0-diethylerythromycin A 
2,4,6,8, 1 0, 1 2-Hexadesmethyl-6,8-diethylerythromycin A 
2,4,6.8,1 0,1 2-Hexadesmethyl-4.8-diethylerythromycin A 
45 H H Et H H Et 2,4,6,8, 10,12-Hexadesmethyl-2,8-diethylerythromycin A 

2,4,6,8, 1 0, 1 2-Hexadesmethy 1-4,6-diethylerythromycin A 
2,4,6,8, 1 0, 1 2-Hexadesmethyl-2,6-diethylerythromycin A 
2,4,6,8,10, 12-Hexadesmethyl-2.4-diethylerythromycin A 
2,4,6,8, 10,1 2-Hexadesmethyl-8. 1 0, 1 2-triethy lerythromycin A 
50 Et Et H Et H 2,4,6,8, 10,12-Hexadesmethyl-6, 10,1 2-triethy lerythromycin A 

2,4,6,8, 10,1 2-Hexadesmethy 1-4, 10,1 2-triethy lerythromycin A 
2,4,6,8, 1 0, 1 2-Hexadesmethyl-2, 1 0, 1 2-triethy lerythromycin A 
2,4,6,8, 1 0, 1 2-Hexadesmethyl-6,8, 1 2-triethylerythromycin A 
2,4,6,8, 1 0, 1 2-Hexadesmethy 1-4,8, 1 2-triethylerythromycin A 
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Et 


H 


Et 


H 


H 


Et 


2.4.6,8, 1 0 J 2-HexacIesmethyl-2.8. 1 2-triethylerythromycin A 




Et 


H 


H 


Et 


Et 


H 


2,4,6,8, 1 0, 1 2-Hexadesmethyl-4.6, 1 2-triethylerythroinycin A 




Et 


H 


H 


Et 


H 


Et 


2,4,6,8,1 0,1 2-Hexadesmethyl-2,6.1 2-triethylerythromycin A 




Et 


H 


H 


H 


Et 


Et 


2,4.6,8, 1 0, 1 2-Hexadesmethyl-2.4, 1 2-triethylerythromycin A 


5 


H 


Et 


Et 


Et 


H 


H 


2.4.6,8,10,1 2-Hexadesinethyl-6,8,10-triethy!ej7thromycin A 




H 


Et 


Et 


H 


Et 


H 


2,4,6,8, 1 0, 1 2-Hexadesmethyl-4,8. 1 0-triethylerythromycin A 




H 


Et 


Et 


H 


H 


Et 


2,4,6,8, 1 0, 1 2-Hexadesmethyl-2,8, 1 0-triethylerythromycin A 




H 


Et 


H 


Et 


Et 


H 


2,4.6,8, 1 0, 1 2-Hexadesmethy 1-4,6, 1 0-triethylerythromycin A 




H 


Et 


H 


Et 


H 


Et 


2,4,6,8, 1 0,1 2-Hexadesmethyl-2,6. 1 0-triethylerythromycin A 


10 


H 


Et 


H 


H 


Et 


Et 


2,4,6,8, 1 0, 1 2-Hexadesmethyl-2,4, 1 0-triethylerythromycin A 




H 


H 


Et 


Et 


Et 


H 


2,4,6,8,1 0,12-Hexadesmethyl-4,6,8-triethylerythromycin A 




H 


H 


Et 


Et 


H 


Et 


2,4,6,8, 1 0, 1 2-Hexadesmethyl-2,6,8-triethylerythromycin A 




H 


H 


Et 


H 


Et 


Et 


2,4,6,8, 10,12-Hexadesmethyl-2,4,8-triethyierythromycin A 




H 


H 


H 


Et 


Et 


Et 


2,4.6,8. 10,1 2-Hexadesmethy I-2,4.6-triethylerythromycin A 


15 


Et 


Et 


Et 


Et 


H 


H 


2,4,6.8, 1 0, 1 2-Hexadesmethyl-6,8, 1 0. 1 2-tetraethylerythromycin A 




Et 


Et 


Et 


H 


Et 


H 


2,4.6,8, 1 0, 1 2-Hexadesmethyl-4.8. 1 0. 1 2-tetraethylerythromycin A 




Et 


Et 


Et 


H 


H 


Et 


2,4,6,8, 1 0, 1 2-Hexadesmethy 1-2,8, 1 0. 1 2-tetraethylerythromycin A 




Et 


Et 


H 


Et 


Et 


H 


2,4,6,8, 1 0, 1 2-Hexadesmethy 1-4,6, 1 0. 1 2-tetraethylerythromycin A 




Et 


Et 


H 


Et 


H 


Et 


2,4,6,8, 1 0, 1 2-Hexadesmethyl-2,6, 10,1 2-tetraethyierythromycin A 


20 


Et 


Et 


H 


H 


Et 


Et 


2,4,6,8, 10,1 2-Hexadesmethy 1-2,4.1 0,1 2-tetraethylerythromycin A 




Et 


H 


Et 


Et 


Et 


H 


2,4.6,8, 1 0, 1 2-Hexadesmethy 1-4.6,8, 1 2-tetraethylerythromycin A 




Et 


H 


Et 


Et 


H 


Et 


2,4,6,8, 1 0, 1 2-Hexadesmethy 1-2,6,8, 1 2-tetraethylerythromycin A 




Et 


H 


Et 


H 


Et 


Et 


2,4,6,8, 1 0, 1 2-Hexadesmethyl-2,4,8, 1 2-tetraethylerythromycin A 




Et 


H 


H 


Et 


Et 


Et 


2,4,6,8, 1 0, 1 2-Hexadesmethy 1-2,4,6, 1 2-tetraethylerythromycin A 


25 


H 


Et 


Et 


Et 


Et 


H 


2,4,6,8, 1 0, 1 2-Hexadesmethyl-4,6,8, 1 0-tetraethylerythromycin A 




H 


Et 


Et 


Et 


H 


Et 


2,4,6,8, 1 0, 1 2-Hexadesmethyl-2,6,8, 1 0-tetraethylerythromycin A 




H 


Et 


Et 


H 


Et 


Et 


2,4,6,8, 1 0, 1 2-Hexadesmethyl-2,4,8, 1 0-tetraethylerythromycin A 




H 


Et 


H 


Et 


Et 


Et 


2,4,6,8, 1 0, 1 2-Hexadesmethyl-2,4,6, 1 0-tetraethylerythromycin A 




H 


H 


Et 


Et 


Et 


Et 


2,4,6,8,10,1 2-Hexadesmethyl-2,4,6,8-tetraethylerythromycin A 


30 


Et 


Et 


Et 


Et 


Et 


H 


2.4,6,8, 1 0, 1 2-Hexadesmethyl-4,6,8, 1 0, 1 2-pentaethy lerythromycin A 




Et 


Et 


Et 


Et 


H 


Et 


2,4,6,8, 1 0, 1 2-Hexadesmethyl-2,6,8. 1 0, 1 2-pentaethy lerythromycin A 




Et 


Et 


Et 


H 


Et 


Et 


2,4,6,8. 1 0, 1 2-Hexadesmethy 1-2,4.8- 1 0. 1 2-pentaethy lerythromycin A 




Et 


Et 


H 


Et 


Et 


Et 


2,4,6,8. 1 0, 1 2-Hexadesmethyl-2,4,6, 1 0,1 2-pentaethylerythromycin A 




Et 


H 


Et 


Et 


Et 


Et 


2,4,6,8, 1 0, 1 2-Hexadesmethy 1-2,4 ,6„8, 1 2-pentaethylerythromycin A 


35 


H 


Et 


Et 


Et 


Et 


Et 


2,4,6,8 , 1 0, 1 2-Hexadesmethy 1-2,4,6,8, 1 0-pentaethy lerythromycin A 




Et 


Et 


Et 


Et 


Et 


Et 


2,4,6,8,10,12-Hexadesmethyl-2,4,6,8,10,12-hexaethylerythromycin 



Although in the Exsimples that follow the AT-encoding DNA fragments from S. 
hygroscopicus ATCC 29253, S. venezuelae ATCC 15439, and S, caelestis NRRL-2821 were 

40 used to replace resident AT-encoding DNA fragments in the eryPKS to yield desmethyl, 

desmethylethyU and desmethylhydroxyerythromycins, it is understood that many malonate, 
ethylmalonate, and hydroxymalonate AT-encoding DNA fragments can be used in place of or 
in addition to the heterologous malonate, ethylmalonate, and hydroxymalonate- AT DNA 
fragments described herein to produce the same desmethyl, desmethylethyl, and 

45 desmethylhydroxyerythromycin compounds. Examples of DNA fragments encoding 
malonate-AT domains that can be used in place of or in addition to those specifically 
described in the Examples below include but are not limited to the DNA fragments encoding 
AT domains from modules 2, 5, 8, 9, 1 1, or 12 of the rapamycin PKS genes from S. 
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hygroscopicus, the AT domain from module 2 of the PKS responsible the synthesis of 
methymycin or pikromycin by S. venezuelae^ the AT domains from modules 3 and 7 of the 
PKS responsible for the synthesis of tylosin by S. fradiae, or the AT domains from modules 
1, 2, 3 and 7 of the PKS responsible for the synthesis of spiramycin by S. ambofaciens, 
5 Examples of DNA fragments encoding ethylmalonate-AT domains that can be used in place 
of or in addition to those specifically described in the Examples below include but are not 
limited to the DNA fragments encoding the AT domain from module 5 of the spiramycin 
PKS genes from & ambofaciens, the AT domain from module 5 of the tylosin PKS genes 
from S.Jradiae, and the AT domain from module 5 of the maridomycin PKS genes of S 

1 0 hygroscopicus. Examples of DNA fragments encoding hydroxymalonate-AT domains that 
can be used in place of or in addition to those specifically described in the Examples below 
include but are not limited to the DNA fragments encoding the AT domain from module 6 of 
the spiramycin PKS genes from S. ambofaciens^ the AT domain from module 6 of the 
maridomycin PKS genes from S. hygroscopicus^ and the AT domain from module 6 of the 

15 leucomycin PKS genes from Streptoverticillium kitasatoensis. Thus the use of any and all 

DNA fragments encoding malonate, ethylmalonate, and hydroxymalonate-ATs to replace any 
of the resident DNA fragments encoding methylmalonate-ATs in the eryPKS genes to result 
in the production of novel derivatives of erythromycin are considered within the scope of the 
present invention. 

20 Furthermore, whereas the NidAT6 domain was exemplified herein to replace the AT 

domains of the starter, module 1 or module 2 in the eryPKS to introduce hydroxyl groups into 
positions 14, 12 and 10, respectively of the polyketide backbone of erythromycin, it is 
xmderstood that the NidAT6 domain can also be used to replace the AT domains of modules 
3, 4, 5 or 6 of the eryPKS to result in the production of erythromycin derivatives containing a 

25 hydroxyl group at position 8, 6, 4 or 2, respectively, of the erythromycin backbone to replace 
the methyl group that is normally seen at the corresponding position. Therefore, all 
compoimds produced from the replacement of an eryAT domain with the NidAT6, including 
the compounds 8-desmethyl-8-hydroxyerythromycin A, 6-desmethyl-6-epierythromycin A, 
4-desmethyl-4-hydroxyerythromycin A and 2-desmethyl-2-hydroxyerythromycin A or their 

30 6-deoxy derivatives and the corresponding strains that produce them are included under the 
scope of this invention. 

Furthermore, whereas the NidAT6 domain was exemplified herein to replace a single 
AT domain in the eryPKS to produce a derivatized erythromycin A molecule containing a 
single additional hydroxyl group, those skilled in the art understand that it is possible to 

35 independently replace two or more AT domains of the eryPKS with the NidAT6 domain to 

obtain derivatized erythromycins v^th two or more additional hydroxyl groups. Examples of 
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erythromycin molecules containing two additional hydroxyl groups include, but are not 
limited to, 2,12-didesmethyl-2,12-dihydroxyerythromycin, 4,10-didesmethyl-4,10- 
dihydroxyerythromycin, and the like. Therefore, all compounds produced from the 
replacement of two or more AT domains of the eryPKS with NidAT6 and the corresponding 
5 strains that make them are included under the scope of this invention. 

It is also understood by those skilled in the art that the placement of the NidAT6 
domain at more than a single position in the eryPKS may result in the genetic instability of 
the hybrid PKS DNA due to homologous recombination that can take place between the 
NidAT6-encoding sequences. To avoid this recombination event, those skilled in the art will 

10 recognize the necessity to introduce changes in the NidAT6 DNA sequence to make a series 
of niodified NidAT6 DNA domains that differ in DNA sequence from each other and from 
the natural NidAT DNA but which still encode a functional domain that can be used to 
replace a methyl group with an hydroxyl group in erythromycin. Such derivatives of 
NidAT6, when used in combination with NidAT6 or with each other to make two or more 

15 AT replacements will render the hybrid PKS stable to mutation through homologous 

recombination. The methods for making such modifications are well known to those of 
ordinary skill in the art. Thus all derivatives of NidAT6 that encode a functional domain that 
can be used to introduce hydroxyl groups into erythromycin are included in the scope of this 
invention. 

20 It is also understood that the NidAT6 domain can be used in combination with other 

heterologous malonyl AT or ethylAT domains to introduce chemical diversity into the 
erythromycin backbone. For example, the NidAT6 domain can be used to replace the 
eryAT2 domain in Sac, erythraea strain ER720 EryATl/LigATl which itself has the eryATl 
domain replaced by a malonyl AT domain from a PKS from Streptomyces hygroscopicus to 

25 result in the production of the compound 10,12-didesmethyl-lO-hydroxyerythromycin. 

Similarly, the NidAT6 domain can be used to replace the eryAT2 domain in Sac. erythraea 
strain ER720 EryAT4/NidAT5 that itself has the eryAT4 domain replaced by an ethyl AT 
domain from the Nid PKS to result in the production of the compound 6,10-didesmethyl-6- 
ethyl-lO-hydroxy erythromycin A. Therefore, all compounds produced from the substitution 

30 of two or more AT domains from the eryPKS with any combination of AT domains that 

encode malonyl, ethyl or hydroxymalonyl starter domains and their corresponding strains that 
produce them are included imder the scope of this invention. 

Furthermore, those skilled in the art will understand that the NidAT6 domain can be 
used in gene replacements in srmG, tylG, the rifPKS DNA, the rapPKS DNA, or other 

35 modular PKS genes, to introduce hydroxyl groups in spiramycin, tylosin, rifamycin, 

rapamycin or other reduced polyketides. Therefore, the \ise of NidAT6, or any other AT 
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domain that specifies a hydroxymalonyl starter domain, such as the AT domains from the 
sixth modules of the carbomycin PKS, the midecamycin PKS or maridomycin PKS, as 
examples, to introduce hydroxyl groups at one or more positions in the polyketide backbones 
of erythromycin, spiramycin, rifamycin, rapamycin or any other polyketide that employs a 
5 modular PKS for its assembly is included under the scope of this invention. In addition, the 
use of NidAT6, or any other AT segment that specifies a hydroxymalonyl starter domain, in 
combination with LigAT2, or any other segment that specifies a malonyl starter domain, or in 
combination with NidATS, or any other AT segment that specifies an ethylmalonyl starter 
domain, to make two or more replacements in the eryA, srmG, tylG or any other modular 
1 0 PKS to introduce chemical diversity into erythromycin, spiramycin, tylosin or other 

polyketides that employ a modular PKS for their synthesis is included under the scope of this 
invention. 

Whereas a 3.0 kb segment of the rapA gene from Streptomyces hygroscopicus ATCC 
29253 encoding the rapligase and adjacent ERS domains is exemplified herein to replace the 

1 5 ATs domain of the eryPKS to yield a hybrid PKS that encodes the production of the 
compoundl3-desethyl-13-(3\4'-dihydroxycyclohexyl)methylerythromycin A, it is 
understood that several other gene replacements using longer segments of the rapA gene may 
be used in place of the 3.0 kb segment in analogous gene replacement experiments to create 
strains that yield the same product. Examples of longer segments include but are not limited 

20 to those that contain the rapligase - ERS segment and the adjacent ACP domain ofrapA that 
can be used to replace the ATs - ACPs segment of the eryPKs and those that contain the 
rapligase - ERS segment and the adjacent ACP - KSl -encoding segment of rapA to replace 
the ATs - ACP - KSl segment of the eryPKS. Thus, all segments of rapA that can be used in 
gene replacements with the ery^/gene to result in the synthesis of 13-desethyl-13-(3',4'- 

25 dihydroxycyclohexyl)methylerythromycin A, along with the strains that produce 1 3-desethyl- 
13-(3%4'-dihydroxycyclohexyl)methylerythromycin A are included under the scope of the 
present invention. 

In addition, whereas the production of 13-3,4-dihdroxycyclohexylerythromycin A in 
Sac. erythraea EryATs/rapligase 3.0 was dependent upon the feeding of th^ compoxmd 3,4- 

30 dihydroxycyclohexylcarboxylic acid to the culture medium, those skilled in the art 

understand that various salts and esters of 3,4-dihydroxycyclohexylcarboxylic acid can be 
used in place of 3,4-dihydroxycyclohexylcarboxylic acid to yield 13-desethyl-13-(3',4'- 
dihydroxycyclohexyOmethylerj^thromycin A. Furthermore, derivatives of 3,4- 
dihydroxycyclohexylcarboxylic acid or its corresponding salts or esters can also be fed to 

35 Sac. erythraea EryATs/rapligase 3.0 in place of 3,4-dihydroxycyclohexyIcarboxyIic acid or 
its salts or esters to result in the production of derivatives of 13-desethyl-13-(3^4'- 
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dihydroxycyclohexyl)methylerythromycin A. Examples of derivatives of 3,4- 
dihydroxycyclohexylcarboxylic acid or its salts or esters that can be fed to Sac. erythraea 
EryATs/rapligase 3.0 include, but are not limited to, 3-hydroxycyclohexylcarboxylic acid, 4- 
hydroxycyclohexylcarboxylic acid, shikimic acid, 3-methoxy-4- 
5 hydroxycyclohexylcarboxylic acid, and the like to yield 1 3-desethyl-l 3-(3'- 
hydroxycyclohexyl)methylerythromycin A, 1 3-desethyl-l 3 -(4'- 
hydroxycyclohexyl)methylerythromycin A, 13-desethyl-13-(3%4%5'- 
trihdroxycyclohexyI)methylerythromycin A, 1 3-desethyl-13-(3'-methoxy-4'- 
hydroxycyclohexyl)methylerythromycin A, and the like, respectively. Therefore, all 
1 0 derivatives of 1 3-desethyl-l 3-(3 ',4' -dihydroxycyclohexyl)methy [erythromycin A that can be 
produced by the feeding of derivatives of 3,4-dihydroxycyclohexylcarboxylic acid to Sac. 
erythraea EryATs/rapligase 3.0 are included within the scope of the present invention. 

Furthermore, those of ordinary skill understand that following the methods described 
herein for replacement of resident AT-encoding DNA fragments in the eryPKS, the DNA 
1 5 fragments encoding malonate-ATs in S. hygroscopicus, S. venezuelae, or S. caelestis^ and 
ethylmalonate or hydroxymalonate- ATs in S. caelestis may be replaced with those AT- 
encoding DNA fragments from the eryPKS which utilize methylmalonyl CoA as a substrate. 
As with the eryPKS, all combinations are contemplated, leading to the production of, for 
example, 1 3-methylrapamycin, 15-methylrapamycin, 33-methylrapamycin, 13,15- 
20 dimethylrapamycin, 13,15,33-trimethylrapamycin, and lO-methylpikromycin. 

The methods of the present invention are widely applicable to all erythromycin- 
producing microorganisms, of which a non-exhaustive list includes Saccharopolyspora 
species, Streptomyces griseoplanus, Nocardia sp., Micromonospora sp., Arthrobacter sp. and 
Streptomyces antibioticiis. Of these. Sac. erythraea is the most preferred. Other hosts, which 
25 normally do not produce erythromycin but into which the erythromycin biosynthesis genes 
can be introduced by cloning, can also be employed. Such strains include but are not limited 
to Streptomyces coelicolor and Streptomyces lividans or Bacillus subtilis, as examples. In 
each of the other erythromycin-producing strains, replacement of the resident AT domains in 
the erythromycin PKS is conducted by double homologous recombination using cloned 
30 eryPKS sequences on both sides of the AT domain to be replaced to effect the switching of 
the resident AT with a heterologous AT as illustrated in the Examples that follow. 

Many other variations of the methods that are illustrated in the Ex£miples that follow 
will occur to those skilled in the art. For example, whereas the plasmids pUCl 8, pUCl 9, 
pGEM3Zf, and pCS5 were employed in the present invention for the cloning of the LigAT2, 
35 venAT, rapAT14, NidAT5, or NidAT6-encoding DNA fragments and constmction of the 
integration vectors, other plasmids, phage, or phagemids including but not limited to 
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pBR322, pACYC184, MlSmplS, M13mpl9, pGEM7Zf and the like can be used in their 
place to allow the same constructions to be made. Furthermore, m£my alternative strategies 
can be followed for the cloning of the heterologous AT-encoding DNA fragments into 
integration vectors that enable homologous recombination to occur in corresponding regions 
5 of the eryPKS. Examples of alternative strategies include the use of longer or shorter 

fragments of DNA corresponding to either the AT domains or the flanking sequences, using 
different restriction sites for the cloning of the AT domains or the adjacent flanking 
sequences, or changing the sequence of a resident AT-encoding DNA fragment so that it 
expresses a domain which recognizes malonyl CoA as a substrate rather than methylmalonyl 

10 CoA. All such variations are within the scope of the present invention. Similarly, employing 
altemative strategies to introduce DNA into Sac. erythraea or other erythromycin-producing 
hosts for the purpose of effecting gene exchange to result in the production of novel 
erythromycins, such as conjugation, transduction or electroporation are also included within 
the scope of the present invention. 

15 Those skilled in the art also understand that erythromycins B, C and D are naturally 

occurring forms of erythromycin and therefore would be produced zs novel derivatives in 
Sac. erythraea by the modifications disclosed herein. Production of these forms may be 
further enhanced by inactivation of eryK (Stassi, D. et aL J. Bacteriology , 175:182-189, 
(1993)) to yield erythromycin B derivatives, eryG (S. F. Haydock et al. Mol Gen. Genet, 

20 230: 1 20- 1 28( 1 99 1 )) to yield erythromycin C derivatives and eryK and eryG to yield 

erythromycin D derivatives. Furthermore, in Sac, erythraea^ 6-deoxy forms of the novel 
erythromycins A, B, C and D can be generated by inactivation oieryF (J. M. Weber et al. 
Science 252:1 14-1 17(1991)) (in addition to those specified above), which encodes the 
hydroxylase responsible for hydroxylating the C-6 position. In addition, conversion of 6- 

25 deoxy forms of the novel erythromycins A, B, C and D to their corresponding erythromycin 

A, B, C, and D derivatives may be accomplished by cloning additional copies or by 
employing other means of overexpression of the eryF gene in the production host. Similarly, 
conversion of novel forms of erythromycins B, C and D to novel forms of erythromycin A 
may be achieved by expressing or overexpressing eryK and/or eryG in the production host. 

30 The methodologies for generating erythromycins B, C zind D and 6-deoxy erythromycins A, 

B, C and D are well known to those of ordinary skill in the art. 

Those skilled in the art also understand that erythronolide B and 3-a- 
mycarosylerythronolide B are naturally occurring intermediates in the biosynthesis of 
erythromycin and therefore would be produced as novel intermediates in Sac. erythraea by 
35 the modifications disclosed herein. Production of these forms may be further enhanced by 
inactivation of any of the eryB genes to yield erythronolide B or eryC genes to yield 3-a-L- 
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mycarosylerythronolide B (Weber et al. J, Bacteriol 172:2372-2383 (1990)) and Haydock et 
al. Mol Gen. Genet 230:120-128 (1991)). Furthermore, 6-deoxy forms of these novel 
intermediates can be generated by inactivation of eryF as described above. The 
methodologies for generating erythronolide B and 3-a-mycarosylerythronolide B, as well as 
their 6-deoxy derivatives, are well known to those of ordinary skill in the art. 

Bacterial Strains, Plasmid Vectors, and Grov^h Media 
The erythromycin-producing microorganism used to practice the following examples 
of the invention was Sac. erythraea ER720 (J.P. DeWitt, J. Bacteriol 164: 969 (1985)). The 
host strain for the growth of E. coli derived plasmids was DH5a from GIBCO BRL, 
Gaithersburg, MD). The S. hygroscopicus strain that carries the Lig-PKS cluster is available 
from the American Type Culture Collection , Bethesda, MD under the accession number 
ATCC 29253. The S. venezuelae strain that carries the venAT domain described herein is 
available from the American Type Culture Collection , Bethesda, MD under the accession 
number ATCC 15439. 

E. coli bacteria carrying pUC18/venAT has been deposited at the Agricultural 
Research Culture Collection (NRRL), 1815 N. University Street, Peoria, Illinois 61604 
U.S.A., as of December 23, 1996, under the terms of the Budapest Treaty and will be 
maintained for a period of thirty (30) years from the date of deposit, or for five (5) years after 
the last request for the deposit, or for the enforceable period of the U.S. patent, whichever is 
longer. The deposit and any other deposited material described herein are provided for 
convenience only, and are not required to practice the present invention in view of the 
teachings provided herein. The DNA sequence in all of the deposited material is incorporated 
herein by reference. E. coli bacteria carrying pUCl 8/venAT was accorded NRRL Deposit 
NoB-21652. 

Plasmid pUCl 8 and pUC19 can be obtained from GIBCO BRL. Plasmid pCS5, a 
multiftinctional vector for integrative transformation of Sac. erythraea is described in Vara, 
etal.J. Bacteriology, 171:5872-5881 (1989) and is referred to therein as pWHM3. Cosmid 
pNJl is described in Tuan, et al. Gene, 90: 21-29 (1990). 

Sac. erythraea was grovsoi for protoplast formation and routine liquid culture in 50 mL 
of SGGP medium (Yamamoto, et al, J. Antibiotic. 39:1304 (1986)), supplemented with 10 
\x% of thiostrepton/nnL for plasmid selection where appropriate. 

Reagents and General Methods 
Commercially available reagents were used to make compoimds, plasmids and genetic 
variants of the present invention, including butyric acid, ampicillin, thiostrepton, restriction 
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endonucleases, T4-DNA ligase, and calf intestine alkaline phosphatase. The nucleotide 
sequence of the eryA genes from Sac, erythraea has been deposited in the GenBank database 
under the accession numbers M63676 and M63677 and are publicly available. 

Standard molecular biology procedures (Maniatis et al^ supra) were used for the 
5 construction and characterization of replacement plasmids. Plasmid DNA was routinely 

isolated by the alkaline lysis method (H. C. Bimboim and J. Doly, 1979 Nucleic Acids Res. 
7: 1 5 1 3) or with QIAprep Spin Plasmid kit (Qiagen, Inc., Chatsworth, CA) according to the 
manufacturers instructions. Restriction fragments were recovered from 0.8-1% agarose gels 
with Prcp-A-Gene (BioRad). The products of ligation for each step of the plasmid 

10 constructions were used to transform the intermediate host, E. coli DH5a (GIBCO BRL), 

which was cultured in the presence of Emipicillin to select for host cells carrying recombinant 
plasmids. Selection for insert DNA with X-gal was used where appropriate. Typically, LB 
plates contain 30 mL of LB agar (Maniatis et aL, supra), Plasmid DNAs were isolated from 
individual transformants that had been grown in liquid culture and characterized with respect 

1 5 to known restriction sites. DNA sequence determination was by cycle sequencing (ftnol 
DNA Sequencing System, Promega Corp. Madison, WI) according to the manufacturer's 
instructions. 

SCM medium consists of 20 g Soytone, 15 g Soluble Starch, 10.5 g MOPS, 1.5 g 
Yeast Extract and 0.1 g CaCl2 per liter of distilled H2O. SGGP medium consists of 4 g 

20 peptone, 4 g yeast extract, 4 g casamino acids, 2 g glycine, 0.5 g MgS04« 7 H2O, 10 g 

glucose, 20 niL of 500 mM KH2PO4 per liter of aqueous solution (Yamamoto, et al, 1986, J. 
Antibiotic. 39:1304). Pm buffer (per liter) is 200 g sucrose, 0.25 g K2SO4 in 890 mL H2O, 
with the addition after sterilization of 100 mL 0.25 M TES, pH 7.2, 2 mL trace elements 
solution (Hopwood, et aly 1985, Genetic Manipulation of Streptomyces A Laboratory 

25 Manual, The John Innes Foundation), 0.08 mL 2.5 M CaCl2, 1 0 mL 0.5% KH2PO4, 2 mL 
2.5 M MgCl2. 

Integrative transformation of Sac. erythraea protoplasts, and routine growth and 
sporulation were carried out according to procedures described in Donadio, et al., 1991 , 
Science 115:97; Weber and Losick, 1988, Gene 68:173; and Yamamoto, et aL, 1986, J. 
30 Antibiotic. 39:1304. 

Oligo primers used in the PGR amplifications and described in the Examples below 
are as follows: 



5 ' - ATCTACACSTCSGGCACSACSGGCAAGCCSAAGGG- 3 ' 
5 ' -CTSAAGGCSGGCGGCGCSTACGTSCCSATCGACCC-3 ' 
5 ' - CGCGAATTCCTAGGCTGGCGGTGATGTTCA- 3 ' 
5' -GCCGGATCCATGCATACGTCGGCAGGGAGGTAC-3 ' 



SEQ ID NO: 3 
SEQ ID NO: 4 
SEQ ID NO: 5 
SEQ ID NO: 6 
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5' 


-GCTCGAATTCGCTGGTCGCGGTGCACCT- 3 ' 


SEQ 


ID 


NO: 


7 


5' 


-GACGGATCCGGCCCTAGGCTGCGCCCGGCTCG- 3 ' 


SEQ 


ID 


NO: 


8 


5' 


-TTGGGATCCTATGCATTCCAGCGCGAGCGC- 3 ' 


SEQ 


ID 


NO: 


9 


5' 


-GAGAAGCTTGGCGCGACTTGCCCGCT- 3 ' 


SEQ 


ID 


NO: 


10 


5' 


-TTTTTTAAGCTTGGTACCTGCTCACCGGCAACACCG- 3 ' 


SEQ 


ID 


NO: 


11 


5' 


-TTTTTTGGATCCCTGCAGCCTAGGGTCGGAGGCACTGCCGGT- 3 ' 


SEQ 


ID 


NO: 


12 


5' 


-TTTTTTCTGCAGTATGCATTCCAGGGCAAGCGGTTCT- 3 ' 


SEQ 


ID 


NO: 


13 


5' 


-TTTTTTGAATTCACGCGTTGCCCGCGGCGTAGGCGC- 3 ' 


SEQ 


ID 


NO: 


14 


5' 


-GATCGAATTCCCTAGGACGGCAGTCCTGCTCACC- 3 ' 


SEQ 


ID 


NO: 


15 


5' 


-GATCGGATCCATGCATACGTCGGAAGGTCGACCCG- 3 ' 


SEQ 


ID 


NO: 


16 


5' 


-TTCGAAGAATTCCCTAGGGTTGCCTTCCTGTTCGAC- 3 ' 


SEQ 


ID 


NO: 


17 


5' 


-TTCGAAAAGCTTATGCATAGACCGGCAGATCCACCG- 3 ' 


SEQ 


ID 


NO: 


18 


5' 


-CGGTSAAGTCSAACATCGG-3 ' 


SEQ 


ID 


NO: 


19 


5' 


-GCRATCTCRCCCTGCGARTG-3 ' 


SEQ 


ID 


NO: 


20 


5» 


-GAGAGAGGAACCAACGCGCACGTGATCGTCGAAGAGGCACCAGC- 3 ' 


SEQ 


ID 


NO: 


21 


5' 


-GAGAGAGGATCCGACCTAGGCGCGGAGGTCACCGGCGCGACGGCG- 3 ' 


SEQ 


ID 


NO: 


22 


5 • 


-GAGAGACCTAGGAAGCCGGTGTTCGTGTTCCCCGGCCAGGGCT- 3 ' 


SEQ 


ID 


NO: 


23 


5' 


-GAGAGAGGATCCGAGGCCGGCCGTGCGCCCGGACCGAAGACCGCCTC- 3 • 


SEQ 


ID 


NO: 


24 


5' 


-GAGAGAATTCCCTAGGGTCGCCTTCGTCTTTCCCGGGCAGG- 3 ' 


SEQ 


ID 


NO: 


25 


5* 


-TTGAGATCTTATGCATACGAGGGAAGCGGCACCCTGC-3 ' 


SEQ 


ID 


NO: 


26 


5 ' 


-TTTGAATTCACGTCCTCGACGTGCAGCA-3 ' 


SEQ 


ID 


NO: 


35 


5 ' 


-TTTGGATCCCCTAGGGGACGGCCGGGCCACGCC- 3 • 


SEQ 


ID 


NO: 


36 


C 1 

D 




b£Q 




NO : 


J / 


5 • 


-TTTAAGCTTGCGCCCGCCCGTTGGGC-3 ' 


SEQ 


ID 


NO: 


38 


5' 


-ATGGCTTCCGACAGTCCCCGCCCAAGGCCG -3 


SEQ 


ID 


NO: 


39 


5' 


- ACCAATTCCGTCGGCGGGCACCAGGCCACC -3' 


SEQ 


ID 


NO 


40 


5' 


- TTTTGAATTCCCTAGGATGTCACGCGCGGAACTGG - 3 ' 


SEQ 


ID 


NO- 


41 


5' 


-TTTTGCATGCGTCAGTGCGAGCCG -3' 


SEQ 


ID 


NO 


:42 


5' 


- TTTTCTCGAGGTCGGCCCGGAAGT - 3 ' 


SEQ 


ID 


NO 


:43 


5' 


- TTTTAAGCTTATGCATGTCGAGTCGCCGGGGAATGG - 3 ' 


SEQ 


ID 


NO 


:44 



Mass spectrometry was routinely performed with a Finnigan-MAT 7000 mass 
spectrometer equipped with an atmospheric pressure chemical ionization source (APCI). 
Electrospray mass spectrometry (ESI-MS) was performed with a Finnigan-MAT 752-7000 

5 mass spectrometer equipped with a Finnigan atmospheric pressure ioni2ation (API) source. 
HPLC separation was carried out on a Hewlett-Packard 1 050 liquid chromatograph using a 
Prodigy ODS (2) column (5^m, 50x2mm) and a gradient eiution of 5mM ammonium acetate 
and methanol. The flow rate was 0.3 mL/min. 

For large scale preparation of erythromycin derivatives, fermentation beers are 

10 typically adjusted to pH 9 with NH4OH and then extracted two times with an equal volume 
of CH2CI2. The pooled extract is then concentrated to a wet oil (approx. 1 g per liter of 
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fermentation beer). Concentrated extracts are digested in methanol and chromatographed 
over a colunm of Sephadex® LH-20 (Pharmacia Biotech, Uppsala, Sweden) in the same 
solvent. Fractions are tested for bioactivity against Staphylococcus aureus, and active 
fractions are combined and concentrated. When additional column chromatography is 
5 desired to reduce sample weight, the concentrated sample is digested in a solvent system 
consisting of n-heptane, chloroform, ethanol (10:10:1, v/v/v) and chromatographed over a 
column of Sephadex® LH-20 in the same system. Fractions are then analyzed by NMR, 
focusing on the characteristic erythromycin reson£mces around 5 = 5.0 (H-13), 6 = 4.9 (H-l"), 
and 6 = 4.4 (H-V) (Everett and Tyler, J. Chem. See. Perkin Trans. I, pg. 2599 (1985)) and 
10 pooled according to purity. Alternatively, column chromatography is replaced with an 
extraction sequence. In this case, the initial pooled CH2CI2 extract is concentrated to 

approximately 400 mL. This is extracted twice with equal volumes of 0.05 M aqueous 
potassium phosphate with the pH chosen between pH 4.5-6. The aqueous phase is then 
pooled, adjusted to pH 8-9, and extracted twice with equal volumes of ethyl acetate. Finally, 

15 the ethyl acetate extracts are pooled and concentrated. When additional reduction in sample 
weight is desired, the extraction sequence is repeated on a 10-50 fold smaller scale, typically 
yielding about 500 mgs of partially puure material. 

High resolution separation of erythromycin derivatives is obtained by one or more 
rounds of countercurrent chromatography (Hostettmann and Marston, Anal Chim, Acta^ 

20 236:63-76 (1990)). When the weight of the partially pure sample from colunm 

chromatography or the extraction sequence is less than 5 g, but greater than 0.5 g, it is 
digested in 7 mL of the upper phase of a solvent system (3:7:5, v/v/v) consisting of n-hexane, 
ethyl acetate, 0.02 M aqueous potassium phosphate, with a pH chosen between 6.5-8.0, and 
chromatographed on a custom droplet countercurrent chromatography (DCCC) instrument 

25 [100 vertical columns, 0.4 cm dia. x 24 cm length; Hostettmann and Marston, Anal. Chim. 

Acta, 236:63-76 (1990)] in the same system with the upper phase as the mobile phase. Flow 
rates of approximately 120-200 mL/hr are employed. As before, fractions are analyzed by 
NMR and bioactivity, and pooled according to purity. When the weight of the partially pure 
sample is approximately 0.5 g or less, countercurrent chromatography is carried out on an Ito 

30 multi-layered horizontal Coil Planet Centrifuge (P.C. Inc., Potomac, MD) using either the 

system consisting of n-hexane, ethyl acetate, 0.02 M aqueous potassium phosphate, with the 
pH chosen between 6.5-8.0, (3:7:5, v/v/v) employed above, or similar systems in which the 
ratio of hexane to EtOAc and/or the pH are varied. The chromatography is developed either 
isocratically, or with a gradient starting, for example, with the upper phase of a solvent 

35 system consisting of h-hexane, ethyl acetate, 0.02 M aqueous potassium phosphate, with the 
pH chosen between 6.5-8.0, (7:3:5, v/v/v) and finishing with the upper phase of a solvent 
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system consisting of n-hexane, ethyl acetate, 0.02 M aqueous potassium phosphate, at the 
same pH, (1:1:1, v/v/v). In all cases, flow rates of approximately 120 mL/hr are employed. 
As before, fractions are analyzed by NMR and bioactivity, and pooled according to purity. 
Once sufficient purity is achieved, and ^^C NMR spectra are measured with a General 
5 Electric GN500 spectrometer and structural assignments are made with the aid of with the aid 
of correlational spectroscopy (COSY), heteronuclear multiple quantum correlation (HMQC), 
heteronuclear multiple bond correlation (HMBC), and distortionless enhancement by 
polarization transfer (DEPT) experiments. 

The foregoing can be better understood by reference to the following examples, which 
10 are provided as non-limiting illustrations of the practice of the instant invention. 

EXAMPLE 1 : Cloning of the LigAT2 Domain from 
Strevtomvces hv^roscopicus ATCC 29253 
A genomic library of Streptomyces hygroscopicus ATCC 29253 DNA was 

15 constructed in the bifunctional cosmid pNJl (Tuan, et al^ Gene 90: 21-29 (1990)) using 

standard methods of recombinant DNA technology. Briefly, cosmid vector was prepared by 
digesting approximately 5 jig of pNJI with EcdRli dephosphorylating with calf intestinal 
alkaline phosphatase (CIAP) and then digesting with BgRl to generate one arm and also 
digesting 5 |ig of pNJl with //mdlll, dephosphorylating with CIAP and then digesting with 

20 BgRl to generate the other. Insert DNA was prepared by partially digesting approximately 25 
Hg of high molecular weight S. hygroscopicus chromosomal DNA with SaulllA according to 
the procedure outlined in Maniatis, et al. supra. SaulWA fragments of approximately 35 kb 
were recovered from a 0.5% low melting point agarose gel by melting the appropriate gel 
slice to 65'^C, adding 3 volumes of TE buffer, gently extracting 2X with phenol and once with 

25 chloroform and ethanol precipitating the aqueous phase. For the ligation, approximately 3 |ig 
of this chromosomal DNA was mixed with approximately 0.5 |ig of each cosmid arm and 
EtOH precipitated. The precipitate was resuspended in 7 p.L of water to which was added 2 
^iL of 5X ligation buffer and 1 jaL of T4 DNA ligase. The mixture was incubated overnight 
at 16**C. Gigapackll XL (Stratagene®) was used for packaging 2 |iL of the ligation mix 

30 according to the manufacture's instructions. The host bacterium was E. coli ER1772 from 
New England Biolabs (Beverly, MA). Twenty-six colonies were examined by restriction 
analysis and all were foimd to contain insert DNA. Individual colonies were picked into 
thirty-four 96-well plates to give a 99.99% probability that the library represented all S. 
hygroscopicus sequences. Further restriction analysis demonstrated the average insert size to 

35 be about 30 kb. 
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The library was screened with a 1 .45 kb Sstl-Mscl DNA fragment encompassing the 
ketosynthase (KS) domain from module 5 of the erythromycin PKS gene eryAIII (Donadio 
and Katz, 1992, Gene, Hi: 51-60). The DNA fragment was labeled with 32p using the 
Megaprime DNA labeling system (Amersham Life Science, Arlington Heights, IL). Colonies 
5 (3600) were transferred from 96-well plates to Hybond-N nylon membranes (Amersham Life 
Science, Arlington Heights, IL) and probed according to procedures outlined in Maniatis, et 
al supra. Hybridization was performed at 65°C and a stringency wash carried out with O.lx 
SSC at GS'^C. About 60 cosmid clones were chosen which gave the strongest signals with this 
PKS probe. 

10 We also decided to screen Southern digests of these clones with a second probe in 

order to identify potential genetically linked peptide synthetases in this strain. The probe was 
designed from conserved motifs of nonribosomal peptide synthetases (Borchert et al, 1992, 
FEMS Microbiology Letters, 92: 175-180) and consisted of a mixture of two degenerative 
35-mers, SEQ ID NO:3 and SEQ ID NO:4. The mixed probe was labeled using DNA 5* End 

15 Labeling System (Promega Corp., Madison, WI). The 60 cosmid clones were digested with 
Smal and mn on 0.9% agarose gels. Southern analysis was performed according to Maniatis, 
et al supra, except that hybridization was overnight at 55°C and the stringency wash was 
with 0.5x SSC at 55''C. Two cosmids, 54 and 58, were identified using this second probe. 
Thirteen additional cosmids were subsequently isolated by re-probing the cosmid library with 

20 a Ikb fragment from the left of the insert of cosmid 58. Two of these thirteen cosmids, 
designated A15 and A 16, were then further analyzed by restriction analysis and DNA 
sequencing. Restriction and sequence analysis of a 32.8 kb continuous segment of DNA 
from Al 6 revealed a type I PKS cluster with four PKS modules. A genetic map of the cluster 
is shown in FIG. 6. Since an unusual CoA ligase-like domain was found in ORFl (PKSl), 

25 the cluster was named "Lig-PKS". 

The nucleotide sequence of the LigAT2 domain from Lig-PKS (top strand) and its 
corresponding amino acid sequence (bottom strand) are shown in FIG. 7 (SEQ ID NO:l and 
SEQ ID NO:31 respectively). When SEQ ID NO:31 was compared with the 14 AT domains 
in the rapamycin PKS (Growtree Progreim, GCG, Madison WI), it was found to cluster with 

30 malonate-specifying rapamycin domains (see Growtree analysis of FIG. 3). Therefore, it was 
predicted that the LigAT2 specifies malonate as its cognate extender unit during synthesis of 
the polyketide encoded by Lig-PKS. 



EXAMPLE 2: Construction of plasmid oUCl 8/LigAT2 
35 Two PGR oligonucleotides (SEQ ID NO:5 and SEQ ID NO:6) were designed to 

subclone the 985 bp DNA segment encoding the LigAT2 domain from the Lig-PKS cluster 
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and to introduce two unique restriction sites, Avrll and Nsil, for cassette cloning. The unique 
restriction sites Avrll and Nsil required for cassette cloning of the AT-encoding DNA were 
chosen based on multiple sequence alignment using the programs PILEUP and PRETTY 
(GCG, Madison WI) which compared the amino acid sequences of LigAT2, venAT, rapAT2, 
5 rapATS, rapATS, rapAT9, rapATl 1 , rapATl 2, rapATl 4, eryATl , eryAT2, eryAT3, eryAT4, 
eryAT5, eryAT6, and a monofunctional AT from Streptomyces glaucescens (R.G. Summers 
et al., Biochemistry 34:9389-9402 (1995)). The selection and positioning of the restriction 
enzyme sites were based on the following considerations: (i) extent of amino acid sequence 
conservation among the various ATs, with the sites being positioned outside, but near the 
1 0 regions of greatest conservation, (ii) absence of the sites from the heterologous AT-encoding 
DNA and the eryAT flanking DNA and (iii) impact of the amino acid sequence changes 
resulting from translation of these sites on the heterologous AT amino acid sequence. This 
necessitated nucleotide changes, shown in bold in FIG. 8, at the beginning and near the end of 
the LigAT2-encoding DNA sequence. (In FIG. 8, the underlined nucleotides are the wild-type 
1 5 sequence.) In addition, two other restriction sites, EcoRl and 5amHI, were also introduced at 
the 5' ends of the N-terminal and C-terminal oligonucleotides, respectively, for convenient 
subcloning of the PCR-generated product. The approximately 1 kb LigAT2 domain was 
amplified from Cosmid 58 as follows: The 1 00 \iL PGR reaction mixture contained 10 ^iL of 
lOx PGR buffer (Bethesda Research Laboratories), 2 |iL of 10 mM dNTP mixture, 2-4 ixL of 
20 50 mM MgCl2, 100 pM of each oligo, 10-50 ng of template DNA and water to 100 ^L. 

Cycling conditions were as follows: One cycle at 96'*C/6 min, 80°C/1 min (add 5 U Taq 
DNA Polymerase during this 1 min) and 72°C/2 min; 30 cycles at 95'*C/1 min, 65**C/1 min 
and 72**C/2 min with a 5 min extension at 72'*C for the last cycle. The entire reaction was 
then run on a 1% agarose gel and the desired fragment was isolated with Prep-A-Gene 
25 (BioRad, Hercules, CA). The PGR product was digested with EcoRl and BamHl and 

subcloned into the EcoKl and BamHl sites of pUCl 8. The ligation mixture was transformed 
into E, coli DH5a (GIBCO BRL) according to the manufacturer's instructions and 
transformants were selected on LB plates containing 150 |ig/mL ampicillin and 50 \xL of a 
2% solution of X-gal for blue/white selection. Clones were confirmed by restriction analysis 
30 and the fidelity of the insert was confirmed by DNA sequencing. The final plasmid construct 
was named pUCl 8/LigAT2. 

EXAMPLE 3: Construction of plasmid pErvATl/LigAT2 
pEryATl/LigAT2 was constructed using standard methods of recombinant DNA 
35 technology according to the schematic outlines of FIGS. 9 and 1 0. To construct a gene- 
replacement vector specific for the eryATl domain, the two DNA regions immediately 
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adjacent to eryATl -encoding DNA were cloned and positioned adjacent to the LigAT2- 
encoding DNA as described in EXAMPLE 2. The 5' and 3* boundaries of eryATl were 
designated as 3825 and 4866, and correspond to the deposited sequence (GenBank 
accession number M63676). To subclone the DNA fragment upstream of the eryATl domain 
5 encoding region from the Sac, erythraea chromosome, two PGR oligonucleotides (SEQ ID 
NO:7 and SEQ ID NO:8) were designed so that an Ecd^ site was added at the 5' end of the 
region and AvrW-BamYH restriction sites were introduced at the 3* end. The 5'-flanking region 
(about 1 kb) was PGR generated as described in EXAMPLE 2 using plasmid pAIEN22 DNA 
as template. (This plasmid is a pUC19 derivative containing 22 kb of Sac, erythraea DNA 

10 from an EcdRii site upstream of eryAI to an Nhel site in eryAII cloned into EcoRl and Xbal 

cut pUC19). The PGR product was subcloned into EcoRI and BamHl sites of pUG19 and the 
ligated DNA transformed into E. coli DH5a (GIBGO BRL) according to the manufacturer's 
instructions. Clones were selected on LB plates containing 150 |ig/mL ampicillin and 50 |iL 
of a 2% solution of X-gal for blue/white selection. Glones were confirmed by restriction 

1 5 analysis and the fidelity of the insert was confirmed by DNA sequencing. The resulting 
construct was named pUG19/AT 1/5 -flank. 

For subcloning the 3 -flanking region of the eryATl from Sac. erythraea 
chromosome, two PGR oligonucleotides (SEQ ID NO:9 and SEQ ID NO: 10) were designed 
so that BamHl-Nsil restriction sites were introduced into the 5' end of the region and a 

20 HindlU restriction site was added to the 3' end. The 3 -flanking region (about 1 kb) was also 
generated by PGR using pAIEN22 as template as described above. The PGR fragment was 
subcloned into the BamUl and Hindlll sites of pUG19 and the ligated DNA transformed into 
E, coli DH5a as above. Glones were selected on LB plates containing 150 jig/mL ampicillin 
and 50 \xL of a 2% solution of X-gal for blue/white selection. Glones were confirmed by 

25 restriction analysis and the fidelity of the insert was confirmed by DNA sequencing. This 
intermediate construct was named pUG19/ATl/3 -flank. The two flanking regions were 
joined by first isolating the 1 kb BamUl-Hmdlll fragment (3'-flank) from pUG19/ATl/3*- 
flank and then ligating this fragment to pUG19/AT 1/5 -flank cut with BamUl and Hindlll. 
Ligated DNA was transformed into E, coli DH5a and clones isolated as described. The 

30 resulting plasmid was named pUG 1 9/ATl -flank. The 2. 1 kb £coRI and Hindlll fragment 

from pUG19/ATl -flank was then isolated and ligated to pGS5 cut with the same enzymes to 
generate pGS5/ATl -flank. The final step in the construction of pEryATl/LigAT2 was to 
ligate the 1 kb LigAT2 fragment having Avrll and Nsil ends to pGS5/ATl -flank cut with the 
same enzymes to give the gene replacement/integration plasmid pEryATl/LigAT2, All 

35 ligation mixtures were transformed into the intermediate host E, coli DH5a and clones 
selected as previously described. 
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mobile phase. Fractions were analyzed by bioassay and ^ H NMR. Two macrolide containing 
peaks of bioactivity were observed 2ind the later eluting species was readily characterized by 
its and ^^C NMR spectra as 12-desmethyl-12-deoxy erythromycin A. Parameters from 
the NMR spectra are listed in Table 2. The assignments were made with the aid of 

5 correlational spectroscopy (COSY), heteronuclear multiple quantum correlation (HMQC), 
heteronuclear multiple bond correlation (HMBC), and distortionless enhancement by 
polarization transfer (DEPT) experiments. Mass spectral data of this sample was also 
consistent with the structural assignment. Electrospray ioni2:ation (ESI) of this sample 
revealed an M+H"*" ion at M/Z 704, which is in full accord with erythromycin A lacking both 

1 0 a methyl group and a hy droxy 1 group. 



Table 2 

NMR chemical shift ( ) assignments for 12-desmethyl-12-deoxyerythromycin A 

in CDCI3 

15 



2-H 


2.74 


I'-H 


4.47 


3-H 


4.15 


2'-H 


3.25 


4-H 


2.01 


3'-H 


2.49 


5-H 


3.58 


4'-Ha 


1.67 


7-Ha 


1.91 


4'-Hb 


1.23 


7-Hb 


1.66 


5'-H 


3.54 


8-H 


2.86 


6'-H3 


1.23 


10-H 


2.70 


N(CH3)2 


2.30 


11-H 


4.05 


1"-H 


4.85 


12-Ha 


1.71 


2"-Ha 


2.40 


12-Hb 


1.46 


2"-Hb 


1.59 


13-H 


5.06 


4"-H 


3.03 


14-H2 


1.59 


5"-H 


4.04 


I5-H3 


0.89 


6"-H3 


1.30 


2-CH3 1.19 




3"-CH3 1.25 




4-CH3 1.13 




OCH3 3.33 




6-CH3 1.38 








8-CH3 1.19 








IO-CH3 


1.11 







35 

EXAMPLE 6: Construction of plasmid pErvAT2/LigAT2 
pEryAT2/LigAT2 was constructed using standard methods of recombinant DNA 
technology. To make a gene-replacement vector specific for the eryAT2 domain, two DNA 
regions flanking eryAT2 were cloned and positioned adjacent to the DNA encoding the 
40 domain to be inserted in order to effect homologous recombination. Boundaries of the AT2 
domeiin were chosen as described in EXAMPLE 2. The 5* and 3' boundaries of eryAT2 are 
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anisaldehyde reagent. The region of the novel spot was instead scraped from the TLC plate 
and the silica resin re-extracted with ethyl acetate-methanol (1:1) and then twice with ethyl 
acetate. The combined solvent phases were then dried in a Speed Vac. Mass spectrometric 
analysis revealed the novel compound to have a mass of 704, which corresponds to the 
5 molecular ion plus a proton (M+H"*") of 1 2-desmethyl-l 2-deoxy erythromycin A. 

To acquire milligram quantities of highly purified material for performance of NMR 
analysis, the culture was grown in a 42-liter LH Fermentation Series 2000 fermentor. SCM 
medium was used for growth of inoculum and for the fermentation. Seed for the 
fermentation was grown in two steps. In the first step, frozen vegetative inoculum was used 
10 to seed 100 mL of SCM medium in a 500 mL Erienmeyer flask. For the second step, 2-liter 
Erlenmeyer flasks containing 600 mL of SCM medium were seeded at 5% from the first 
passage growth. Each step was incubated for 3 days at 32 ^C on a rotary shaker operated at 
225 rpm. 

Thirty liters of SCM medium were prepared in the 42-liter fermentor and sterilized at 

15 12PC and 15 psi for 1 hour. Antifoam (XFO-371, Ivanhoe Chemical Co., Mundelein, IL) 
was added initially at 0.01% and then was available on demand. The fermentor was 
inoculated with 1 .5 liters of the second passage seed growth. The temperature was controlled 
at 32°C. The agitation rate was 260 rpm and the air flow was 1.3 vol/vol/min. The head 
pressure was maintained at 6 psi. During fermentation pH was controlled at 7.3 with 5 M 

20 propionic acid. The fermentation was terminated at 1 1 1 hours, and the fermentation beer was 
adjusted to pH 8. This was followed by two extractions with equal volumes of CH2CI2. The 
pooled CH2CI2 extract was then concentrated to approximately 400 mL and extracted twice 
with equal volumes of 0.05 M aqueous potassium phosphate pH 5.5. The aqueous phase was 
pooled and adjusted to pH 8, and then extracted twice with equal volumes of ethyl acetate. 

25 The ethyl acetate extracts were pooled and concentrated to yield 5 ml oil. The extraction 

sequence described above was then repeated to yield 600 mg of oil after concentration. Next, 
the sample was split and each half was digested in 2.5 ml each of the upper and lower phases 
of a solvent system consisting of n-hexane, ethyl acetate, 0.02 M aqueous potassium 
phosphate, pH 8, (1 : 1 : 1 , v/v/v). These were then chromatographed on the Coil Planet 

30 Centrifuge using the upper phase as the mobile phase. Fractions were analyzed by bioassay 
against Staphylococcus aureus and NMR. Two macrolide containing peaks of bioactivity 
were observed in both samples, and the later eluting peaks from each sample, which 
contained most of the bioactivity, were pooled and concentrated. The concentrated material 
was then digested in 2.5 mL each of the upper and lower phases of a solvent system 

35 consisting of n-hexane, ethyl acetate, 0,02 M aqueous potassium phosphate, pH 6.5, (6:4:5, 
v/v/v), and was chromatographed on the Coil Planet Centrifuge using the upper phase as the 
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EXAMPLE 4: Construction of Sac, erv/femgg ER720 ErvATl/LigAT2 
An example of a 12-desmethyl-12-deoxyerythromycin A producing microorganism 
was prepared by replacing the DNA fragment encoding the methylmalonyl acyltransferase 

5 domain in module 1 of the erythromycin PKS (EryATl) of Sac, erythraea ER720 with a 

newly discovered DNA fragment encoding a malonyl acyltransferase domain (LigAT2) from 
S, hygroscopicus ATCC 29253. This was accomplished with the recombinant plasmid, 
pEryATl/LigAT2, prepared as described in EXAMPLE 3. Transformation of Sac, erythraea 
ER720 and resolulion of the integration event were carried out according to the following 

10 method. Sac. erythraea ER720 cells were grown in 50 mL of SGGP medium for 3 days at 
32^*0 and then washed in 10 mL of 10,3% sucrose. The cells were resuspended in 10 mL of 
PM buffer containing 1 mg/mL lysozyme and incubated at SO^C for 15-30 minutes until most 

of the mycelial segments were converted into spherical protoplasts. The protoplasts were 
washed once with Pm and then resuspended in 3 mL of the same buffer containing 10% 

1 5 DMSO for storage in 200 ^L aliquots at -80°C. 

Transformation was accomplished by quickly thavvring an aliquot of protoplasts, 
centrifuging for 15 seconds in a microfuge, decanting the supernatant, and resuspending the 
protoplasts in the Pm remaining in the tube. Ten |iL of DNA solution was added (3 p.L of 
pEryATl/LigAT2 DNA from EXAMPLE 3 at about 1 \x^\xL in 7 ^L of Pm buffer) and 

20 mixed with the protoplasts by gently tapping the tube. Two tenths of a mL of 25% PEG 8000 
in T buffer (Hopwood, et al., 1985, Genetic Manipulation of Streptomyces A Laboratory 
Manual, The John Innes Institute) was then added, mixed by pipetting the solution 3 times 
and the suspension immediately spread on a dried R3M plate. The plate was incubated at 
30°C for 20 hours and overlaid with 2 mL of water containing 100 jig/mL thiostrepton, dried 

25 briefly and incubated 4 more days at 30*'C. 

To select stable transformants (integrants) colonies arising on the transformation 
plates were re-streaked onto R3M plates containing thiostrepton (20 |xg/mL). Two colonies 
were confirmed to be thiostrepton resistant and one of these was inoculated into SGGP 
containing thiostrepton (10 |ig/mL) to isolate chromosomal DNA for Southern analysis. 

30 Integration of the plasmid DNA into the ER720 chromosome was further confirmed by 

Southem hybridization (data not shown). Hybridization was at 65**C and the stringency wash 
was with 0. 1 x SSC at 65^C. 

The confirmed integrant was grown in SGGP without antibiotic for four days and then 
plated onto non-selective R3M plates for sporulation. Spores were plated on R3M plates to 

35 obtain individual colonies, which were then screened for sensitivity to thiostrepton, indicating 
loss of the plasmid sequence firom the chromosome. Five thiostrepton sensitive colonies were 
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selected and their chromosomal DNA digested with Sph\ and analyzed by Southern 
hybridization. Hybridization was at eS'^C and the stringency wash was with O.lx SSC at 
65°C. In three of the five thiostrepton sensitive colonies, a probe consisting of an 
approximately 3 kb £coRI////>7dIII fragment from pEryATl/LigAT2 hybridized with 
5 fragments of approximately 3.5 and 1 .6 kb, indicating that LigAT2 had replaced EryATl in 
the chromosomes of these resolvants. The strain was named Sac, erythraea ER720 
EryATl/LigAT2. 

EXAMPLES: Analysis of compounds produced bv S^c. ervthr aea ER720 ErvATl/LigAT2 

1 0 Compounds produced by the recombinant Sac, erythraea strain, ER720 

EryATl/LigAT2, whose construction is described in EXAMPLE 4, were characterized by 
TLC, bioautography, mass spectrometry and NMR analysis. 

For TLC analysis cells were grown in either SGGP or SCM medium for 4-5 days at 
SO^'C. An aliquot of culture (1 .5 mL) was centrifuged for 1 minute in a microfuge to remove 

1 5 cells. One mL of the resulting supernatant was removed to another microfuge tube and the 
pH adjusted to 9.0 by the addition of 6 ^L of NH4OH. Then 0.5 mL of ethyl acetate was 
added, the tube was vortexed for 10 sec and then centrifuged for approximately 5 min to 
achieve phase separation. The organic phase was removed to another tube, and the aqueous 
phase was re-extracted with 0.5 mL of ethyl acetate. The second organic phase was 

20 combined with the first and dried in a Speed Vac. The residue was taken up in 10 |aL of ethyl 
acetate and 5 ^iL was spotted onto a Merck 60F-254 silica gel TLC plate. The plate was run 
in isopropyl ether:methanol:NH40H (75:35:2). Erythromycin derivatives were visualized by 
spraying the plates with anisaldehyde:sulfuric acid;ethanol (1:1:9). Using this reagent, a 
novel compound predicted to be 12-desmethyl-12-deoxy erythromycin A, appeared as a blue 

25 spot running slightly faster than erythromycin A. 

To detect biological activity, a TLC-bioautography assay was performed. In this 
assay, one microliter of the extracted sample firom above was spotted onto a TLC plate which 
was run as described above. The plate was then air-dried and placed in a sterile bio-assay 
dish (245x245x25 nrni). The plate was then covered with 100 mL of antibiotic medium 1 1 

30 (DIFCO-BACTO) containing Staphylococcus aureus as an indicator strain and incubated 

overnight at 37''C. As with the positive controls, a clear zone of inhibition developed around 
the sample spot indicating that the novel compound had bioactivity. 

To determine whether the novel spot seen on TLC had the molecular mass 
corresponding to the predicted 12-desmethyl-12-deoxy erythromycin A, an ethyl acetate 
35 extract was further analyzed by mass spectrometry. The mass spectrometry samples were 
isolated by TLC basically as described above except that plates were not sprayed with the 
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designated as 8255 and 9282, respectively, and correspond to deposited eryy4/ sequence 
(GenBank accession number M63676). To subclone the DNA fragment upstream of the 
eryAT2 DNA, two PGR oligonucleotides (SEQ ID N0:1 1 and SEQ ID N0:12) were 
designed so that a Hindlll site was added at the 5' end of the region and Avrll-Pstl restriction 

5 sites were introduced at the 3' end. For subcloning the 3'-flanking region of eryAT2, two 

PGR oligonucleotides (SEQ ID NO: 13 and SEQ ID NO: 14) were designed so that Pstl-Nsil 
restriction sites were introduced at the 5' end of the region and an EcoU site at the 3* end. 
Both the 5 -flanking and 3'.flanking regions (about 1 kb each) were PGR generated as 
described in EXAMPLE 3. In the case of the 5'-flanking region, the PGR product was 

10 subsequently subcloned into Hindlll and Pstl sites of pUCl 8 whereas the PGR product of the 
3*-flanking region was subcloned into the Pstl and EcoRl sites of pUCl 8. Ligations, 
transformations and confirmations of selected clones were performed as in EXAMPLE 3. 
The resulting construct containing the AT2 5 -flanking region was designated pUC18/AT2/5*- 
flank and the construction containing the AT2 3*-flanking region was designated 

1 5 pUG 1 8/AT2/3'-flank. The two flanking regions were then joined by first isolating the 1 kb 
Pstl and Ecom fragment (3'-flank) from pUG18/AT2/3'-flank, and ligating this fragment to 
pUG 1 8/AT2/5'-flank cut with Pstl and EcoRl, The ligation was transformed into £ coli 
DHSa and clones isolated as described. The resulting plasmid was named pUG18/AT2- 
flank (FIG. 1 1). The 2.2 kb £coRI and Hindlll fragment from pUG18/AT2-flank was then 

20 isolated and ligated to pGS5 cut with the same enzymes to generate pGS5/AT2-flank. The 
final step in the construction of pEryAT2/LigAT2 was to ligate the LigAT2 encoding DNA 
fragment from pUC18/LigAT2 having y4vrll and Nsil ends (described in EXAMPLE 2) to 
pCS5/AT2-flank cut with the same enzymes to give the gene replacement, integration 
plasmid pEryAT2/LigAT2 (FIG. 12). All ligations were transformed into the intermediate 

25 host E. coli DH5a and clones selected as previously described. 

EXAMPLE?: Constniction of Sac, ervthraea ER720 ErvAT2/LigAT2 
An example of a 10-desmethy Erythromycin A and lO-desmethyl-12- 
deoxyerythromycin A producing microorganism was prepared by replacing the 

30 methylmalonyl acyltransferase domain of module 2 of the erythromycin PKS (EryAT2) of 
Sac, erythraea ER720 with a newly discovered malonyl acyltransferase domain (LigAT2) 
fi-om 5. hygroscopicus ATGG 29253. This was accomplished with the recombinant plasmid, 
pEryAT2/LigAT2, prepared as described in EXAMPLE 6. Transformation of ER720 and 
selection and confirmation of stable resolvants were carried out essentially as described in 

35 EXAMPLE 4. Two thiostrepton sensitive colonies were selected and their chromosomal 

DNA cut with Sphl and analyzed by Southern hybridization. In one of the two thiostrepton 
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sensitive colonies, a probe consisting of an approximately 1 kb LigAT2 sequence hybridized 
with a chromosomal DNA fragment of approximately 900 bp, indicating that LigAT2 had 
replaced EryAT2 in the chromosome of this resolvant. The strain was named Sac. erythraea 
ER720 EryAT2/LigAT2, 

5 

EXAMPLE 8: Analysis of compounds produced by 
Sac, ervthraea ER720 ErvAT2/LigAT2 
Compounds produced by the recombinant Sac. erythraea strain, ER720 
EryAT2/LigAT2, whose construction is described in EXAMPLE 7, were characterized by 
10 TLC, bioautography and mass spectrometry. 

For small scale analysis, the cells were grown in either SGGP or SCM medium for 4- 
5 days at 30°C. The culture was processed for TLC analysis essentially as described in 
EXAMPLE 5. Two novel compounds predicted to be 10-desmethylerythromycin A and 10- 
desmethyl-12-deoxy erythromycin A, appeared as blue spots with the lower spot running 
1 5 slightly slower than erythromycin A and upper spot running slightly faster than erythromycin 
A. 

To detect biological activity, a TLC-bioautography assay was performed essentially as 
described in EXAMPLE 5. In this assay, 0.2 to 1 microliter of the extracted sample from 
above was spotted onto a TLC plate which was run as described. The plate was then air-dried 

20 and placed in a sterile bio-assay dish (245x245x25 nam). The plate was then covered v^th 
100 mL of antibiotic medium 1 1 (DIFCO-BACTO) containing Staphylococcus aureus as an 
indicator strain. The inhibition zones were developed by overnight incubation of the plate at 
37 oC. As with the positive controls, a zone of inhibition developed around the two novel 
spots (compounds) indicating that each have bioactivity against Staphylococcus aureus. 

25 To determine whether the novel spots seen on TLC had the molecular masses 

corresponding to the predicted 10-desmethylerythromycin A and lO-desmethyl-12- 
deoxyerj^thromycin A, an ethyl acetate extract was further analyzed by mass spectrometry. 
The mass spectrometry samples were isolated by TLC similarly to the method described 
above except that plates were not sprayed with the anisaldehyde reagent. Instead, two regions 

30 which contain the novel spots were scraped from the TLC plate and the silica resin re- 
extracted with ethyl acetate-methanol (1:1) and then twice with ethyl acetate. The combined 
solvent phases were then dried in a Speed Vac. In addition to the samples described above, a 
crude ethyl acetate extract was also analyzed by LC-MS, in which the sample components 
were first separated by liquid chromatography and then analyzed by mass spectrometry. 

35 Mass spectrometric analysis revealed the two novel compounds to have masses of 720 and 
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704, which correspond to the molecular ion plus a proton (M+H"*") of 10- 
desmethylerythromycin A and lO-desmethyl-12-deoxy erythromycin A, respectively. 

EXAMPLE 9: Cloning of the venAT Domain from Streptomvces venezuelae 
5 A genomic library of Streptomyces venezuelae ATCC 15439 DNA was constructed 

in the bifimctional cosmid pNJl (Tuan, et al. Gene 90: 21-29 (1990)) using standard 
methods of recombinant DNA technology. A cosmid from this library, pVenl7, was 
characterized by Southern analysis and Sstl fragments of approximately 3.5, 3.8, and 4.0 kb 
were found to hybridize to a 1 .37 kb Smal fragment that encompasses the ketosynthase (KS) 

10 domain from module 2 of the erythromycin PKS gene eryAI (Donadio et al^ Science 252: 

675-679 (1991)). The 4.0 kb Sstl fragment was then subcloned into pUC19 to give pVen4.0. 
The nucleotide sequence of pVen4.0 insert DNA was determined from single strand DNA 
templates prepared from M13mpl8 and M13mpl9 (Yanisch-Perron, et al. Gene ,33:103 
(1985)) subclones using Sequenase version 2.0 with 7-deaza-dGTP (United States 

1 5 Biochemical, Cleveland, OH) and 5'-[a-32p] or 5*-[a-33p]-dCTP (NEN Research Products, 
Boston, MA). Because pVen4.0 did not contain the entire AT domain, the nucleotide 
sequence was extended using pVenl7 DNA as the template. The nucleotide sequence of the 
venAT domain (SEQ ID NO:2) and its corresponding amino acid sequence (SEQ ID NO:32) 
is shovm in FIG. 13 (top and bottom strands respectively). 

20 

EXAMPLE 10: Construction of plasmid pErvATl /venAT 
pEryATl /venAT was constructed using standard methods of recombinant DNA 
technology according to the schematic outlines of FIGS. 14 and 15. Two PGR 
oligonucleotides (SEQ ID NO: 1 5 and SEQ ID NO: 1 6) were designed to subclone the 1 .03 kb 

25 DNA fragment that encodes the venAT domain (FIG. 14) from the S. venezuelae PKS cluster 
and to introduce two unique restriction sites, AvrW and Nsil, for cassette cloning (described in 
EXAMPLE 2), This necessitated nucleotide changes (shown in bold in FIG. 14) at the 
beginning and near the end of the venAT sequence (underlined nucleotides are the wild-type 
sequence). In addition, two other restriction sites, EcoRl and BamHl, were also introduced at 

30 the 5' ends of the N-terminal and C-terminal oligonucleotides, respectively, for convenient 
subcloning of the PCR-generated product. The approximately 1 kb venAT-encoding DNA 
was PGR amplified from cosmid pVenl7 template DNA (EXAMPLE 2) using VentR® DNA 

Polymerase (New England Biolabs). A typical PGR reaction contained 10 i^L ThermoPol 
Buffer, 10 piL formamide, 10 of 20% glycerol, 55 |xL water, 100 pmole of each primer, 
35 and approximately 0.2 jxg DNA. The sample was heated to 99**G for 2 minutes, and then 

allowed to cool to 80°G for 2 minutes, at which time 16 jxL of a 1.25 mM mixture of dATP, 
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dCTP, dGTP, and dTTP and 2 units of Vent DNA polymerase were added. A temperature 
cycle of 35 seconds at 96.5°C and 2 minutes 15 seconds at 72'*C was then repeated 30 times, 
followed by a 3 minute incubation at 72°C. The desired PGR fragment was then isolated 
from low melting agarose by standard procedures. The PGR product was ligated to Hindi 
5 digested pUGl 8 and transformed into E. coli DH5a (GIBGO BRL) according to the 

manufacturer's instructions. Glones were selected on LB plates containing 150 ^g/mL 
ampicillin and 50 jiL of a 2% solution of X-gal for blue/white selection. Clones were 
confirmed by restriction analysis and the fidelity of the insert was confirmed by DNA 
sequencing. The final construct was named pUClS/venAT. 
10 The final step in the construction of pEryATl/venAT was to ligate the 1 kb venAT 

fragment having y4vrll and Nsil ends to pCS5/ATl -flank (EXAMPLE 3) cut with the same 
enzymes to give the gene replacement/integration plasmid pEryATl/venAT (FIG. 15). All 
ligations were tremsformed into the intermediate host E. coli DH5a and clones selected as 
previously described. 

15 

EXAMPLE 1 1 : Construction of ^^c. ervthraea ER720 ErvATl/venAT 
A 12-desmethyl-12-deoxy erythromycin A producing microorganism was prepared by 
replacing the methylmalonyl acyltransferase domain of module 1 of the erythromycin PKS 
(EryATl) of Sac. erythraea ER720 with a newly discovered malonyl acyltransferase domain 

20 (venAT) from S. venezuelae ATGG 15439 This was accomplished with the recombinant 
plasmid, pEryATl /venAT, prepared as in EXAMPLE 10. Transformation of ER720 and 
selection and confirmation of stable resolvants were carried out essentially as described in 
EXAMPLE 4. Four thiostrepton sensitive colonies were selected and their chromosomal 
DNA cut with Pvull and analyzed by Southern hybridization. In two of the four thiostrepton 

25 sensitive colonies, a probe of venAT sequence hybridized with chromosomal DNA fragments 
of approximately 4.2 and 2.4 kb, indicating that venAT had replaced EryATl in the 
chromosomes of these resolvants. The strain was named Sac. erythraea ER720 
EryATl /venAT. 

30 EXAMPLE 12: Analysis of compounds produced by 

Sac, ervthraea ER720 Erv ATI /venAT 
Compounds produced by the recombinant Sac, erythraea strain, ER720 
EryATl /venAT, whose construction is described in EXAMPLE 1 1, were characterized by 
TLG, bioautography, and mass spectrometry. 
35 For TLG analysis cells were grown in either SGGP or SGM medium for 4-5 days at 

30'*C. The culture was processed for TLG essentially as described in EXAMPLE 5. A novel 
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compound predicted to be 12-desmethyl-12-deoxy erythromycin A, appeared as a blue spot 
running slightly faster than erythromycin A. 

To detect biological activity, a TLC-bioautography assay was performed essentially as 
described in EXAMPLE 5. As with the positive controls, a clear zone of inhibition developed 
around the sample spot indicating that the novel compound was bioactive. 

To determine whether the novel spot seen on TLC had the molecular mass 
corresponding to the predicted 12-desmethyl-12-deoxy erythromycin A, an ethyl acetate 
extract was further analyzed by mass spectrometry. The mass spec samples were isolated by 
TLC basically as described above except that plates were not sprayed with einisaldehyde. The 
region of the novel spot was instead scraped from the TLC plate and the silica resin re- 
extracted with ethyl acetate-methanol (2:1) and then twice with ethyl acetate. The combined 
solvent phases were then dried in a Speed Vac. Mass spectrometric analysis revealed the 
novel compound to have a mass of 704, which corresponds to the molecular ion plus a proton 
(M+H"*") of 12-desmethyl-12-deoxy erythromycin A. 

EXAMPLE 13: Construction of olasmid pUClQ/rapATH 
Two PGR oligonucleotides (SEQ ID NO: 1 7 and SEQ ID NO: 1 8) were designed to 
subclone the 1023 bp rapAT14-encoding DNA fragment from the rapamycin biosynthetic 
gene cluster (GenBank Accession #: X86780) and to introduce two unique restriction sites, 
>4vrII and Nsil^ for cassette cloning (described in EXAMPLE 2). This necessitated nucleotide 
changes (shown in bold in FIG. 16) at the beginning and near the end of the rapAT14 
sequence. (In FIG. 16, the underlined nucleotides are the wild-type sequence.) In addition, 
two other restriction sites, EcoKL md //mdlll, were also introduced at the 5' ends of the N- 
terminal and C-terminal oligonucleotides, respectively, for convenient subcloning of the 
PCR-generated product. The approximately 1 kb rapAT14-encoding DNA was amplified by 
PCR using chromosomal DNA from Streptomyces hygroscopicus ATCC 29253 as template. 
The PCR conditions were as follows: The 100 \xL reaction mixture contains 10 |iL of lOx 
Thermopol Buffer (New England Biolabs), 2% glycerol, 10% formamide, 100 pmoles of each 
oHgo, 100-200 ng of template DNA and water to 84 \iL. The sample was then heated to 99''C 
for two minutes followed by cooling to SO^'C for two minutes at which time 16 |iL of a dNTP 
solution (1.25 mM dATP and dTTP, 1.5 mM dCTP and dGTP) and 1 nL of VentR® DNA 

Polymerase (New England Biolabs) was added. Cycling was as follows: 30 cycles at 
96.5°C/35 sec, 65**C/1 min and 72**C/1 .5 min followed by one cycle at 72**C for 3 min. The 
entire reaction was then run on a 1 .2% low-melting agarose gel and the desired fragment was 
isolated by melting the appropriate gel slice at 65**C, adding 3 volumes of TE buffer, 
extracting 2X with phenol and once with chloroform, and ethanol precipitating the aqueous 
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phase. The isolated DNA was ligated directly into HincU digested pUC19. The ligation 
mixture was transformed into E. coli DH5a (GIBCO BRL) according to the manufacturer's 
instructions and transformants were selected on LB plates containing 150 jig/mL ampiciilin 
and 50 jiL of a 2% solution of X-gal for blue/white selection. Clones were confirmed by 
5 restriction analysis and the fidelity of the insert was confirmed by DNA sequencing. The 
final plasmid construct was named pUC19/rapAT14. 

EXAMPLE 14: Construction of plasmid pErvATl /rap AT 14 
pEryATl/rapAT14 was constructed using standard methods of recombinant DNA 

10 technology according to the schematic outlines of FIGS. 16 and 17. To make a gene- 
replacement-vector specific for the eryATl domain, the two DNA regions immediately 
adjacent to eryATl were cloned and positioned adjacent to the DNA encoding the rapAT14 
domain in order to allow homologous recombination to occur. The strategy and protocol for 
constructing the intermediate plasmid containing the flanking regions, pCS5/ATl -flank, are 

1 5 described in EXAMPLE 3 and FIG. 9. To insert the rapAT14 fragment between the flanking 
regions, pUC19/rapAT14 (from EXAMPLE 13) was digested with Nsil and Avrll and the 
resulting 1 kb fragment was isolated from a 0.8% agarose gel with Prep-A-Gene. pCS5/ATl- 
flank was also digested with these enzymes and the linearized plasmid was isolated from 
0.8% agarose gel. The two fi-agments were ligated, transformed into the intermediate host E. 

20 colt DH5a and ampiciilin resistant clones were selected as previously described. Insertion of 
the rapAT14 fragment between the ery flanking regions was confirmed by restriction analysis 
and the resulting plasmid was called pEryATl/rapAT14. 

EXAMPLE 15: Construction of 5:qc. erv//ir^ga ER720 ErvATl/raoATH 
25 An example of a 12-desmethyH2-deoxy erythromycin A producing microorganism 

was prepared by replacing the methylmalonyl acyltransferase domain of module 1 of the 
erythromycin PKS (EryATl) of Sac, erythraea ER720 with the acyltransferase domain from 
module 14 of the rapamycin PKS from S. hygroscopicus ATCC 29253. This was 
accomplished with the recombinant plasmid, pEryATl/rapAT14, prepared as described in 
30 EXAMPLE 14. Transformation of Sac. erythraea ER720 and selection and confirmation of 
stable resolvants were carried out essentially as described in EXAMPLE 4. Six thiostrepton 
sensitive colonies were selected and their chromosomal DNA cut with Sty\ and analyzed by 
Southern hybridization. In one of the six thiostrepton sensitive colonies, a probe consisting 
of an EcoR'Hindlll fragment from pCS5 ATI -flank hybridized with a chromosomal DNA 
35 fragment of approximately 1 .6 kb, indicating that rapAT14 had replaced EryATl in the 
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chromosome of this resolvant. The strain was named Sac. erythraea ER720 
EryATl/rapAT14. 

EXAMPLE 1 6: Analysis of compounds produced by 
5 Sac, erythraea ER720 EryATl/rapAT14 

Compounds produced by the recombinant Sac. erythraea strain, ER720 
EryATl/rapATM, whose construction is described in EXAMPLE 15, were characterized by 
TLC and mass spectrometry. For TLC analysis strain 4-A-l was grown in SCM medium for 
4 days at 30*'C. The culture was processed for TLC essentially as described in EXAMPLE 5. 
10 A novel compound predicted to be 12-desmethyl-12-deoxyerythromycin A, appeared as a 
blue spot running slightly faster than erythromycin A. 

To determine whether the novel spot seen on TLC has the molecular mass 
corresponding to the predicted 12-desmethyl-12-deoxyerythromycin A, an ethyl acetate 
extract was further analyzed by Mass Spectrometry. Sac. erythraea ER720 EryATl/rapAT14 
1 5 was grown in SCM medium for 4 days. Ten mL of culture was centrifiiged to remove 

mycelia and pH of the supematant was adjusted to 9 with NH4OH. The supernatant was then 

extracted twice with ethyl acetate and the organic phases pooled and dried. Mass 
spectrometric analysis of this crude ethyl acetate extract shows the mass of the novel spot to 
be 704, which corresponds to the molecular ion plus a proton (M+H"*") of 12-desmethyl-12- 
20 deoxyerythromycin A. 

EXAMPLE 17: Construction of plasmid DEryAT2/rap AT 14 
pEryAT2/rapAT14 w£is constructed using standard methods of recombinant DNA 
technology according to the schematic outlines of FIGS. 16 and 18. To make a gene- 

25 replacement-vector specific for the ery AT2 domain, the two DNA regions immediately 

adjacent to ery AT2 were cloned and positioned adjacent to the DNA encoding the rapAT14 
domain in order to allow homologous recombination to occur. The strategy and protocol for 
constructing the intermediate plasmid containing the flanking regions, pCS5/AT2-flank, are 
described in EXAMPLE 6 and FIG. 12. The final step in the construction of 

30 pEryAT2/rapAT14 was to ligate the 1 kb rapAT14-encoding DNA fragment having AvrW and 
Nsil ends to pCS5/AT2-flank (EXAMPLE 6) cut with the same enzymes to give the gene 
replacement/integration plasmid pEryAT2/rapAT14 (FIG. 18). All ligations were 
transformed into the intermediate host E. coli DH5a and clones selected as previously 
described. 

35 

EXAMPLE 18: Construction of iSac. ervZ/irggg ER720 ErvAT2/rapAT14 
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A lO-desmethylerythromycin A and lO-desinethyl-12-deoxyerythromycin A 
producing microorganism was prepared by replacing the DNA fragment encoding the 
methylmalonyl acyltransferase domedn of module 2 of the erythromycin PKS (Ery AT2) of 
Sac. erythraea ER720 with a DNA fragment encoding a malonyl acyltransferase domain 
5 (rapATH) from S. hygroscopicus ATCC 29253 This was accomplished with the 
recombinant plasmid, pEryAT2/rapAT14, prepared as described in EXAMPLE 17. 
Transformation of ER720 and selection and confirmation of stable resolvants were carried 
out essentially as described in EXAMPLE 4. Four thiostrepton sensitive colonies were 
selected and their chromosomal DNA cut with ^^-pEI and analyzed by Southem 
10 hybridization. In three of the four thiostrepton sensitive colonies, a probe consisting of a 

fragment of 5 '-flanking region of eryAT2 hybridized with a chromosomal DNA fragment of 
approximately 4.3 kb, indicating that rap AT 14 had replaced EryAT2 in the chromosomes of 
these resolvants. The strain was named Sac. erythraea ER720 EryAT2/rapAT14. 

15 EXAMPLE 19: Analysis of compounds produced by 

Sac, erythraea ER720 ErvAT2/rapAT14 
Compounds produced by the recombinant Sac. erythraea strain, ER720 
EryAT2/rapAT14, whose construction is described in EXAMPLE 1 8, were characterized by 
TLC, bioassay, and mass spectrometry. 

20 For TLC analysis cells were grown in either SGGP or SCM medium for 4-5 days at 

BO^'C. The culture was processed for TLC essentially as described in EXAMPLE 5. Two 
novel compounds predicted to be lO-desmethylerythromycin A and lO-desmethyl-12- 
deoxyerythrpmycin A, appeared as blue spots with the lower spot running slightly slower 
than erythromycin A and upper spot running slightly faster than erythromycin A. 

25 To detect biological activity, a bioassay was performed essentially as described in 

EXAMPLE 5. As with the positive controls, inhibition zones developed around the novel 
compounds indicating that they have bioactivity. 

To determine whether the novel spots seen on TLC have the molecular mass 
corresponding to the predicted 1 0-desmethy lerythromy cin A and lO-desmethyl-12- 

30 deoxyerythromycin A, an ethyl acetate extract from another culture was further analyzed by 
mass spectrometry. The sample was a crude extract of a 20 mL culture grown for 4 days. 
Mass spectrometric analysis revealed the two novel compounds to have masses of 720 and 
704, which correspond to the molecular ion plus a proton (M+H^) of 10- 
desmethylerythromycin A and 1 0-desmethy 1-12-deoxy erythromycin A, respectively. 

35 

EXAMPLE 20: Cloning of the ethvlAT Domain from Streptomvces caelestis 
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A genomic library of Streptomyces caelestis NRRL-2821 (U.S. Patent 3,218,239 
issued November 16, 1965) DNA was constructed in the bifunctional cosmid pNJl (Tuan, et 
al. Gene, 90: 21-29 (1990)). Cosmid vector was prepared by digesting 5 \xg of pNJl with 
£;coRI, dephosphorylating with CIAP and then digesting with Bglll to generate one arm and 
5 also digesting 5 \xg of pNJl with HindWl, dephosphorylating with CIAP and then digesting 
with BgM to generate the other. Insert DNA was prepared by partially digesting 
approximately 5 \ig of chromosomal S. caelestis NRRL-2821 DNA with SaulWA. according 
to the procedure outlined in Maniatis et al, supra. Digestion conditions were chosen which 
produced fragment sizes of approximately 40 kb. The ligation was performed by mixing 
10 approximately 1 \ig of the digested chromosomal DNA with 0.5 ^ig of each cosmid arm. The 
ligation was incubated at 16°C ovemight. Gigapackll XL (Stratagene®) was used for 
packaging 2 |iL of the ligation mix according the manufacturer's instructions. 
Transformation was done in E, coli XL 1 -Blue MR cells (Stratagene®). Individual colonies 
were picked into thirty 96-well plates to give a 99.99% probability that the library represented 
15 all S. caelestis NRRL-2821 genomic sequences. 

The library was screened using a probe specific for the S. caelestis NRRL-2821 PKS 
region. The probe was generated by PGR amplification of S, caelestis NRRL-282 1 genomic 
DNA using degenerate primers designed from consensus ketosynthase (KS) and 
acyltransferase (AT) sequences in the GenBank database. The KS specific oligo (SEQ ID 
20 NO: 1 9) and the AT specific oligo (SEQ ID NO:20) generated a 900 bp PCR fragment. The 
PCR reaction contained 10 jaL ThermoPol Buffer, 2 jaL formamide, 25 ^L of 20% glycerol, 3 
^iL 50 mM MgCl2, 45 }aL water, 50 pmole of each primer, and approximately 0.2 \xg DNA. 
The sample was heated to 99°C for 5 minutes, and then placed on ice, at which time a 1 0 ^L 
cocktail consisting of 2 ^L of a 10 mM mixture of dATP, dCTP, dGTP, and dTTP, 2 units of 
25 Vent DNA polymerase, and 7 jiL of water was added. The sample was then transferred to a 
GeneAmp 9600 thermocycler (Perkin Elmer, Foster City, CA) and a temperature cycle of 1 
minute at 95*'C, 4 minutes at 50°C, and 4 minutes at 72°C was repeated 30 times, followed by 
a 15 minute incubation at 72''C. The desired PCR fragment was then isolated from 1 .0% low 
melting agarose by standard procedures. The KS/AT probe was made by labeling 
30 approximately 50 ng of the PCR fragment with using the Megaprime DNA Labeling 
System (Amersham Life Science, Arlington Heights, XL). Library clones (2,880) were 
transferred from the 96-well plates to Hybond-N nylon filters (Amersham) and screened with 
the KS/AT probe according to procedures in Maniatis, et al.y supra. Hybridization was 
performed at 65**C and the final wash was in O.lx SSC at 65°C. Nineteen of the clones 
35 hybridized strongly with the probe. These clones were then digested with 55/1, run on a 1 .0% 
agarose gel and transferred to Hybond-N nylon filters for Southem analysis using the KS/AT 
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probe. A cosmid named as pCELl 8h5 was chosen for further analysis since it contained the 
largest number of hybridizing restriction fragments. 

The Sstl fragments from cosmid pCELlShS were cloned into pGEM-3Zf (Promega, 
Madison, WI) and sequenced using the fmole DNA Cycle Sequencing System (Promega). 
5 The reactions were run on a Sequi-Gen II Sequencing Apparatus (Bio-Rad, Hercules, CA). 
Individual fragments were oriented relative to one another by sequencing off of cosmid 
pCEL18h5 using primers that hybridized to the 5' and 3' ends of the fragments to generate 
upstream and downstream sequence. These sequences were then matched with sequences 
from the individual fragments to place them in the proper order. A very large Sstl fragment 
10 (>1 0 kb) was further digested with Smal to generate smaller fragments for cloning and 
sequencing. 

By searching the GenBank database with the sequences obtained it was possible to 
identify the various enzymatic motifs associated with the niddamycin PKS cluster and to 
group these motifs into modules (see FIG. 19) based on previous knowledge of Type I PKS 

15 organization. The C-6 position of the niddamycin macrolactone ring has an aldehyde derived 
from an ethyl side chain (FIG. 20). It was thus predicted that the AT of module 5 of the 
niddamycin cluster is responsible for incorporating this ethyl group into the growing chain. 
In addition, the carbon at C-7 of the molecule is completely saturated leading to the 
prediction that ER and DH motifs would also be present in module 5. These motifs were, in 

20 fact, found at the predicted region of the sequence. Furthermore, motifs for the preceding 4 
modules were as predicted, with an inactive ketoreductase motif in module 4 which leaves a 
keto group at C-9 of the ring. Sequencing of that KR showed that the nucleotide binding site 
GXGXXG (SEQ ID NO:27) was mutated to DXTXXP (SEQ ID NO:28). The nucleotide 
sequence (SEQ ID NO:29) and corresponding amino acid sequence (SEQ ID NO:33) of the 

25 ethyl AT of module 5 are shown in FIG. 21 (top and bottom strands respectively). 

A knockout experiment was also performed on this cluster, demonstrating that this 
sequence of DNA encodes the pathway for niddamycin biosynthesis. 

EXAMPLE 21 : Construction of nlasmid pEAT4 
30 A multistep strategy was used to construct the plasmid pUC/ethAT/C6 (FIG. 22), 

which consists of the DNA encoding the NidATS domain flanked by approximately 2.0 kb of 
sequence upstream and downstream from the eryAT4 encoding sequences, all contained in 
pUC19. EryAT4 flanking DNA was subcloned from pAIBX85. This plasmid is a pCS5 
derivative containing 8.4 kb of Sac, erythraea DNA from an^ol site to a BamUl site in the 
35 eryAII gene of the erythromycin PKS cluster. These sites correspond to bases 2321 1 and 

31581,respectively, of GenBank accession number M63676. The EryAT4 5 -flanking DNA 
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was isolated by digesting pAIBX85 with Mscl and BstEll (corresponding to nucleotides 
23,2 1 1 and 3 1 ,58 1 , respectively). The resulting 1 800 bp DN A fragment was treated with the 
Klenow Fragment of DNA Polymerase I, ligated into the Smal site of pUC19, and 
transformed into E. coli DH5a. Clones were selected on LB plates containing 1 50 |ig/mL 

5 ampicillin and 50 )j.L of a 2% solution of X-gal for blue/white selection. The clones were 
confirmed by restriction analysis, resulting in the intermediate vector pUC/5'-flank. For 
convenient cloning of the NidAT5-encoding sequences, an ^ vrll site was engineered at the 3' 
end of the 5* flanking DNA. This was accomplished by PGR amplification from the PmR site 
of the 5' flanking DNA to the BstEW site with two oligonucleotides (SEQ ID NO:21 and SEQ 

10 ID NO:22). SEQ ID NO:22 incorporates an ^vrll site and a BamWl site at the 3* end of the 5' 
flanking DNA. PGR conditions were as described in EXAMPLE 20 using Sac, erythraea 
DNA as template with the following changes: Taq polymerase (GIBCO BRL) was used with 
the accompanying lOx buffer instead of VentR® DNA polymerase and cycling conditions 
were 96^C/30 sec, 55°G/30 sec, 72°C/30 sec for 25 cycles. The resulting 300 bp PGR 

1 5 fragment was then digested with PmH and Bamlll, gel purified from a 1 .0 % agarose gel with 
Prep-A-Gene, and ligated back into pUG/5 -flank digested with Pmll and BarnUl to give 
pUG/5'-flank--<4vrIL The ligation was transformed into DH5a and plated onto LB plates 
containing 1 50 ^ig/mL ampicillin. Clones were confirmed by restriction analysis and DNA 
sequencing. 

20 In order to clone the NidAT5-encoding DNA fragment downstream of the 5* flanking 

DNA, an Avrll site was also engineered at the 5' end of the NidATS-encoding DNA. As 
depicted in FIG. 23, an^vrll site could be engineered into the NidATS DNA without altering 
the amino acid sequence. Two PGR oligonucleotides (SEQ ID NO:23 and SEQ ID NO:24) 
were designed to create an Avrll site at the 5' end and a BarriHi site at the 3* end, respectively, 

25 of the NidAT5-encoding DNA, A convenient Fsel site occurs naturally at the 3' end of 
NidAT5 -encoding sequence, so the resulting PGR fragment contains an Fsel site just 
upstream of the PGR engineered BamYll site. SEQ ID NO:23 and SEQ ID NO:24 were used 
in a PGR reaction with the template pi 6-2.2. This plasmid is pUG19 containing a 2.2 kb 
Smal fragment firom module 5 of the niddamycin PKS cluster (see FIG. 19), which 

30 encompasses the sequences encoding NidATS. The resulting 1 .0 kb PGR fi"agment was 

digested with Avrll and 5amHI, purified from a 1 .0 % agarose gel using Prep-A-Gene, and 
cloned into the AvrlllBamlVL sites of pUC/5'-flank-y4vrII. Clones were confirmed by 
restriction analysis and DNA sequencing, creating the intermediate plasmid pUC/S*- 
flank/ethAT. 

35 The EryAT4 3' -flanking DNA was subcloned by digesting pAIBX85 with Pmll and 

Mscl, corresponding to nucleotides 29,231 and 31,209, respectively, fi-om the eryAJI gene 
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(GenBank accession number M63676). The DNA was gel purified on a 1 .0 % agarose gel 
using Prep-A-Gene and ligated into the Smal site of pUC19, The ligation was transformed 
into DH5a and plated as described previously. Clones were confirmed by restriction 
analysis, resulting in the plasmid pUC/3 '-flank. 
5 Attachment of the EryAT4 3'-flanking DNA to the NidATS -encoding sequence was 

accomplished by digesting plasmid pUC/3'-flank with Fsel and BamHl, gel purifying the 
fragment from a 1 .0 % agarose gel using Prep-A-Gene, and ligating it into pUC/5'- 
flank/ethAT that had been previously digested with Fsel and BamHl. The ligation was 
transformed into DH5a as before and clones were analyzed by restriction analysis, resulting 
10 in the intermediate plasmid pUC/ethAT/C-6. The final step was to remove the 

NidAT5/flanking DNA insert from pUC/ethAT/C-6 with EcoRl and Hindlll and ligate it into 
the ^coRI///zndIII sites of pCS5, resulting in the gene replacement/integration plasmid 
pEAT4 (FIG. 24). 

15 EXAMPLE 22: Construction of Sac, ervthraea ER720 EAT4-46 

An example of a 6-desmethyl-6-ethylerythromycin A producing microorganism was 
prepared by replacing the DNA fragment encoding the methyimalonyl acyltransferase domain 
in module 4 of the erythromycin PKS (EryAT4) of Sac. erythraea ER720 with a newly 
discovered DNA fragment encoding an ethylmalonyl acyltransferase domain (NidATS) from 

20 S. caelestis NRRL-2821 . This was aiccomplished using the recombinant plasmid pEAT4, 
prepared as described in EXAMPLE 21. Transformation of Sac. erythraea ER720 and 
selection and confirmation of stable were carried out essentially as described in EXAMPLE 
4. Nine thiostrepton sensitive colonies were selected and their chromosomal DNA cut with 
Mlul and analyzed by Southern hybridization. In three of the nine thiostrepton sensitive 

25 colonies, a probe consisting of an approximately 900 bp fragment spanning a KS/AT domain 
in Streptomyces caelestis hybridized with a chromosomal firagment of approximately 1 .8 kb, 
indicating that NidATS had replaced EryAT4 in the chromosomes of these resolvants. The 
strain was named Sac, erythraea ER720 EAT4-46, referred to as simply EAT4-46. 

30 EXAMPLE 23: Analysis of compounds produced by EAT4-46 

Compounds produced by strain EAT4-46, whose construction is described in 
EXAMPLE 22, were characterized by TLC^ bioautography and mass spectrometry. 

The cells were grown in 30 mL of SCM for 4-5 days at 3(fC. The culture was 
processed for TLC essentially as described in EXAMPLE 5. The results showed that EAT4- 
35 46 produced a compound that migrated with the same rf as erythromycin A produced by wild 

type Sac, erythraea ER720, except in much lower yield. 
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To determine the molecular mass of the compound, an ethyl acetate extract was 
prepared from a 50 mL SCM culture of EAT4-46 as described above, using a proportionate 
amount of reagents. The resulting residue was taken up in 50 yiL of ethyl acetate and run on a 
TLC plate as described previously, except that the plate was not sprayed with anisaldehyde. 

5 The compound of interest was isolated by scraping the silica resin in the vicinity of the spot 
and extracting the resin as described in EXAMPLE 8. Mass spectrometric analysis revealed 
that the compound produced by the EAT4-46 strain had a mass of 734, which corresponds to 
the molecular ion plus a proton (M+H"^) of erythromycin A. 

In an attempt to increase substrate pools for the NidATS ethylmalonyl AT 

10 construction, the EAT4-46 strain was grown in 100 mL of SCM media containing 50 mM 
butyric acid, pH 7.0. The culture was grown for 4 days at SO'^C and then centrifuged for 10 
minutes in a Sorval GLC-4 Centrifuge to pellet the cells. The resulting supernatant was 
adjusted to pH 9.0 by the addition of 600 ^L of NH4OH and extracted twice with 1/2 

volumes of ethyl acetate as described previously. After drying in a Speed-Vac rotary 
15 concentrator, the extracted material was taken up in 100 p.1 of ethyl acetate and 10 |il was 

used for TLC analysis as described previously. Two spots running near eryA were observed 
in the butyric acid fed culture as opposed to only one spot in SCM media alone. To 
determine the molecular mass of the two spots, most of the remainder of the extract was again 
subjected to TLC, and the compounds in the eryA region of the plate were isolated as 
20 described previously. Mass spectrometric analysis revealed that the two spots had molecular 
masses of 734 and 748. A molecular mass of 734 corresponds to the molecular ion plus a 
proton (M-hH"*") of erythromycin A, whereas the species of molecular mass 748 is consistent 
with the molecular mass plus a proton (M+H"^) of ethylerythromycin A. 

25 EXAMPLE 24: Cloning of the NidAT6 Domain from 

Streptomvces caelestis NRRL-2821 
A genomic library of Streptomyces caelestis NRRL-2821 DNA was generated and 
screened with a probe specific for PKS genes as described in EXAMPLE 20. From Southern 
analysis of &/I digests of the positive clones, some clones were selected for further analysis. 

30 These clones were digested with Sma\ and mn on a 1% agarose gel for Southern 

hybridization with the PKS specific probe. The analysis revealed that a second cosmid, 
pCEL13f5, shared many hybridizing bands with pCEL18h5, but also contained two unique 
bands of 1 .9 kb and 6.0 kb. This cosmid was chosen for further analysis in order to determine 
the sequence of the remaining PKS genes in the niddamycin pathway. Cosmid pCEL13f5 

35 was digested with Sstl and the fragments were ligated to pUC 1 9. A large Sstl fragment (> 1 0 
kb) was frirther digested with Smal and ligated to pUCl 9. The ligations were transformed 
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into DH5a cells and clones were selected on LB plates containing 150 |Lig/mL ampicillin and 
50 |xl of a 2% solution of X-gal for blue/white selection. DNA from clones containing the 
appropriate insert was isolated using the QIAprep Spin Plasmid Kit (QIAGEN Inc., 
Chatsworth, CA). Subclones were sequenced using the ABI PRISM Dye Terminator Cycle 
Sequencing Ready Reaction Kit (Perkin Elmer), and the reactions were run on a 4.75% 
acrylamide, 8.3 M urea gel in an Applied Biosystems 373 DNA Sequencing System. 
Ordering of the inserts and motif identification was done as described in EXAMPLE 20. 

The msert in cosmid pCEL13f5 was found to be approximately 25 kb in length, and 
the 5' end of the insert had about 10 kb of identical sequence with the 3* end of the insert in 
pCEL18h5. Together, the two cosmids contain all of the PKS genes of the niddamycin 
pathway (FIG. 19). Based on the structure of niddamycin (FIG. 20), the AT contained in 
module 6 (NidAT6) may utilize hydroxymalonate (tartronate) in the biosynthesis of the C-3, 
C-4, and 0-4 positions of the macrolactone ring of niddamycin. (S. Omura et al (J. 
Antibiotics '^e-M 1-613 (1983)) have suggested that glycolate may be incorporated in the 
biosynthesis of the C-3, C-4 and 0-4 positions of leucomycin, a closely related 16-membered 
macrolide). The nucleotide sequence of NidAT6 (top strand, SEQ ID NO:30) and its 
corresponding amino acid sequence (lower strand, SEQ ID NO:34) are shown in FIG. 25. A 
comparison of the amino acid sequence of NidAT6 with other ATs in the Swissprot database 
shows that NidAT6 resembles methylmalonyl ATs. 

EXAMPLE 25: Construction of plasmid pUC18/NidAT6 
Two PCR oligonucleotides (SEQ ID NO:25 and SEQ ID NO:26) are designed to 
subclone the 1024 bp DNA fragment encoding the NidAT6 domain from the niddamycin 
PKS cluster and to introduce two unique restriction sites, ^vrll and Nsil^ for cassette cloning. 
This necessitates nucleotide changes, shown in bold in FIG. 26, at the beginning and near the 
end of the NidAT6-encoding DNA sequence. The changes shown also cause the replacement 
of a proline codon near the N-terminus of the NidAT6 domain with a valine codon, in order 
to increase the similarity of the domain jimction sequence to that found naturally for some of 
the AT domains of the rapamycin PKS. (In FIG. 26, the underlined nucleotides are the wild- 
type sequence.) In addition, two other restriction sites, EcoRl and 5g/II, are also introduced 
at the 5* ends of the N-terminal and C-terminal oligonucleotides, respectively, for convenient 
subcloning of the PCR-generated product. The approximately 1 kb NidAT6 domain 
encoding DNA is amplified using methods described in Reagents and General Methods from 
Cosmid pCEL13f5. The PCR product is digested with £coRI and BglVl and subcloned into 
the EcoRI and BamYil sites of pUC18. The ligation mixture is transformed into E. coll DH5a 
(GIBCO BRL) according to the manufacturer's instructions and transformants are selected on 
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LB plates containing 150 ^ig/mL ampicillin and 50 of a 2% solution of X-gal for 
blue/white selection. Clones are confirmed by restriction analysis and the fidelity of the 
insert is confirmed by DNA sequencing. The final plasmid construct is named 
pUC18/NidAT6. 

5 

EXAMPLE 26: Construction of plasmid pErvAT2/NidAT6 
pEryAT2/NidAT6 is constructed using standard methods of recombinant DNA 
technology according to the schematic outlines of FIGS. 26 and 27. To make a gene- 
replacement-vector specific for the eryAT2 domain, the two DNA regions immediately 

10 adjacent to eryAT2 are cloned and positioned adjacent to the DNA encoding the NidAT6 

domain in order to allow homologous recombination to occur. The strategy and protocol for 
constructing the intermediate plasmid containing the flanking regions, pCS5/AT2-flank, are 
described in EXAMPLE 6 and FIG. 1 1 . The final step in the construction of 
pEryAT2/NidAT6 is to ligate the 1 kb NidAT6-encoding DNA fragment having ^ vrll and 

1 5 Nsil ends to pCS5/AT2-flank (EXAMPLE 6) cut with the same enzymes to give the gene 
replacement/integration plasmid pEryAT2/NidAT6 (FIG. 27). All ligation mixes are 
transformed into the intermediate host E. coli DH5a and clones are selected and 
characterized as described previously. 

20 EXAMPLE 27: Construction of Sac, ervthraea ER720 ErvAT2/NidAT6 

A lO-desmethyl-lO-hydroxyerythromycin A and 12-deoxy-lO-desmethyMO- 
hydroxyerythromycin A producing microorganism is prepared by replacing the DNA 
fragment encoding the methylmalonyl acyltransferase domain of module 2 of the 
erythromycin PKS (EryAT2) of Sac, erythraea ER720 with a DNA fragment encoding a 

25 hydroxymalonyl acyltransferase domain (NidAT6) from S. caelestis NRRL-2821. This is 
accomplished with the recombinant plasmid, pEryAT2/NidAT6, prepared as described in 
EXAMPLE 26. Transformation of ER720 and selection and confirmation of stable 
resolvants zire carried out essentially as described in EXAMPLE 4. Thiostrepton sensitive 
colonies are then selected and these are confirmed by Southern hybridization, using 

30 conditions described above, to have the EryAT2 replaced by NidAT6. The strain is 
designated Sac. erythraea ER720 EryAT2/NidAT6. 

EXAMPLE 28: Analysis of compounds produced by 
Sac, ervthraea ER720 ErvAT2/NidAT6 
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Compounds produced by the recombinant Sac. erythraea strain, ER720 
EryAT2/NidAT6, whose construction is described in EXAMPLE 27, are characterized by 
TLC, bioassay, and mass spectrometry. 

For TLC analysis cells are grown in either SGGP or SCM medium for 4-5 days at 
3(fC. The culture is processed for TLC essentially as described in EXAMPLE 5. Two novel 
compounds predicted to be 10-desmethyl-lO-hydroxy erythromycin A and 12-deoxy-lO- 
desmethyl-lO-hydroxyerythromycin A, are expected to appear as blue spots running slightly 
slower than erythromycin A. 

To determine whether the novel spots seen on TLC have the molecular mass 
corresponding to the predicted 10-desmethyl-lO-hydroxyerythromycin A and 12-deoxy-lO- 
desmethyl-lO-hydroxyerythromycin A, the remaining extract is further analyzed by mass 
spectrometry. The two novel compounds are predicted to have masses of 736 and 720, which 
correspond to the molecular ion plus a proton (M+H"*") of 10-desmethyl-lO- 
hydroxyerythromycin A and 12-deoxy-lO-desmethyl-lO-hydroxyerythromycin A, 
respectively. 

EXAMPLE 29: Construction of olasmid pErvATs/NidAT6 
pEryATs/NidAT6 was constructed using standard methods of recombinant DNA 
technology according to the schematic outlines of FIGS. 28 and 29. To construct a gene- 
replacement vector specific for the eryATs domain, the two DNA regions immediately 
adjacent to eryATs-encoding DNA were cloned and positioned adjacent to the NidAT6- 
encoding DNA (EXAMPLE 25). The 5' and 3' boundaries of eryATs were designated as 
nucleotides 902 and 1908, and correspond to the deposited eryAI sequence (GenBank 
accession number M63676). To subclone the DNA fragment upstream of the eryATs domain 
encoding region from the Sac, erythraea chromosome, two PCR oligonucleotides (SEQ ID 
NO: 35 and SEQ ID NO: 36) were designed so that an EcoRl site was added at the 5* end of 
the region and AvrWBamlil restriction sites were introduced at the 3' end. The 5 -flanking 
region (about 1.2 kb) was generated by PCR using plasmid pAIEN22 DNA as template under 
conditions described in EXAMPLE 2. The PCR product was subcloned into EcoRl and 
Bamm sites of pUCl 8 and the ligated DNA transformed into E, coli DH5a (GIBCO BRL) 
according to the manufacturer's instructions. Clones were selected on LB plates containing 
150 |ig/mL ampicillin and 50 jiL of a 2% solution of X-gal for blue/white selection. Clones 
were confirmed by restriction analysis and the fidelity of the insert was confirmed by DNA 
sequencing. The resulting construct was named pUCl 8/ATs/5 -flank. 

For subcloning the 3*-flanking region of the eryATs from the Sac. erythraea 
chromosome, two PCR oligonucleotides (SEQ ID NO: 37 and SEQ ID NO: 38 were designed 
so that BarnHl-Nsil restriction sites were introduced into the 5' end of the region and a 
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Hindlll restriction site was added to the 3' end. The 3'-flanking region (about 1.2 kb) was 
also generated by PCR using pAIEN22 as template as described above. The PCR fragment 
was subcloned into the BamHl and HindlU sites of pUC18 and the ligated DNA transformed 
into E. coll DH5a as above. Clones were selected on LB plates containing 1 50 ng/mL 

5 ampicillin and 50 |iL of a 2% solution of X-gal for blue/white selection. Clones were 
confirmed by restriction analysis and the fidelity of the insert was confirmed by DNA 
sequencing. This intermediate construct was named pUC18/ATs/3'-flank (FIG. 28). The 1.2 
kb EcoKL/BamUl 5'-flanking fragment was isolated from pUC18/ATs/5'-flank and subcloned 
into the plasmid pCS5 cut with the same enzymes, generating pCS5/ATs/5*-flank. The 1 .2 kb 

10 BamUl and Hindlll 3'-flanking fragment was isolated from pUC18/ATs/3'-flank and then 

cloned into the pCS5/ATs/5*-flank vector cut with the same enzymes, resulting in pCS5/ATs- 
flank. The final step in the construction of pEryATs/NidAT6 was to ligate the 1 kb NidAT6 
fragment having Avrll and Nsil ends, isolated from pUC18/NidAT6 (EXAMPLE 25) to 
pCS5/ATs-flank cut with the same enzymes to give the gene replacement/integration plasmid 

15 pEryATs/NidAT6 (FIG. 29). All ligation mixtures were transformed into the intermediate 
host E. coli DH5a and clones selected as previously described. 

EXAMPLE 30: Construction of Sac, ervthraea HATS 

20 A 14-hydroxyerythromycin A producing microorganism was prepared by replacing 

the DNA jBragment encoding the acyltransferase domain at the first AT of the amino terminus 
of the erythromycin PKS (EryATs) of Sac, erythraea ER720 that directs the loading of the 
starter unit propionyl CoA, with DNA encoding an hydroxymalonyl acyltransferase domain 
from the sixth module of the niddamycin PKS of S. caelestis NRRL-2821 (NidAT6). This 

25 was accomplished with the recombinant plasmid, pEryATs/NidAT6, prepared as described in 
EXAMPLE 29. Transformation of ER720 

and selection and confirmation of stable resolvants were carried out essentially as described 
in EXAMPLE 4. DNA isolated fi-om thiostrepton-sensitive colonies were employed in 
Southern hybridizations, using stringency conditions described above, to identify clones that 
30 had the EryATs domain replaced by NidAT6. The strain carrying such a replacement was 
designated Sac. erythraea HATS. 

EXAMPLE 3 1 : Analysis of compounds produced by Sac, ervthraea HATS 

35 Sac. erythraea HATS was grown in 30 mL of DOM medium (1 5.0 g soluble starch, 

22.0 g soy flour, 2.0 g CaCOs, L5 g brewers yeast, LO g MgS04-7H20, FeS04-7H20, 50 
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mL soybean oil per liter of distilled H20) supplemented with 0.5% glycine for 7 days at 30^C 
and then centrifiiged for 10 minutes in a clinical centrifuge to remove cells. The supernatant 
was removed to another tube and the pH adjusted to 9.0 by the addition of NH4OH. The 
supernatant was extracted twice with 15.0 mL of dichloromethane and the lower organic 
5 phases were pooled and dried down. The residue was partitioned in a 10:10:1 mixture of 
heptane:methanol:0.05M KH2PO4 and the lower methanol/0.05M KH2PO4 layer was 
collected. A fresh 10:1 methanol/0.05M KH2PO4 mixture was added to the heptane phase 
and partitioned again. The methanol/0.05M KH2PO4 phases were pooled and dried down to 
remove the methanol, and the remaining aqueous phase was adjusted to pH 8 with 0.05M 

10 KH2PO4, pH 8. This was then extracted twice with 1/2 volume of dichloromethane, the 
lower organic phases pooled, and then dried down. The residue was taken up in 30 of 
ethyl acetate and TLC performed essentially as described in EXAMPLE 5. A compound 
predicted to be erythromycin A was detected as a blue spot, along with a compound predicted 
to be 14-hydroxy erythromycin A, which also appeared as a blue spot but running slightly 

1 5 below the erythromycin A spot. 

To determine the mass of the compounds produced by Sac. erythraea HATS, a 16 fiL 
sample from above of the extract was analyzed by mass spectrometry. The analysis identified 
a compound with mass 734, corresponding to the molecular ion plus a proton (M+H*)of 
erythromycin A, as well as a compound with mass 736, which is consistent with the 

20 molecular ion plus a proton (M+H*) of 14-hydroxy erythromycin A. 

EXAMPLE 32: Construction of plasmid pErvMl/NidAT6 

25 pEryMl/NidAT6 was constructed using standard methods of recombinant DNA 

technology according to the schematic outlines of FIGS. 9 and 30. To construct a gene- 
replacement vector specific for the eryATl domain, the two DNA regions immediately 
adjacent to eryATl encoding DNA were cloned and positioned adjacent to the DNA 
encoding the NidAT6 domain in order to allow homologous recombination to occur. The 

30 strategy and protocol for construction of the intermediate plasmid containing the eryATl 

flanking regions, pCS5/ATl -flank, are described in EXAMPLE 2 and FIG. 9 . The final step 
in the construction of pEryMl/nidAT6 was to first digest pUC18/NidAT6 (EXAMPLE 25) 
with Avrll and Nsil, and then ligate the 1 kb NidAT6 fragment generated into pCS5/ATl- 
flank cut wdth the same enzymes to give the gene-replacement/integration plasmid 

35 pEryMl/NidAT6 (FIG. 30). All ligation mixes were transformed into the intermediate host 
E. coll DH5a and clones were selected and characterized as described previously. 
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EXAMPLE 33: Construction of Sac, ervthraea HATl 

A 6-deoxy-12-desmethyM2-epierythromycin A- and 12-desmethyl-12- 
5 epierythromycin A-producing microorganism was prepared by replacing the DNA fragment 
encoding the methylmalonyl acyltransferase domain of module 1 of the erythromycin PKS 
(EryATl) oiSac. erythraea ER720 with a DNA fragment encoding the hydroxymalonyl 
acyltransferase domain (NidAT6) from S, caelestis NRRL-2821. This was accomplished 
with the recombinant plasmid, pEryMl/NidAT6, prepared as described in EXAMPLE 32. 
10 Transformation of Sac. erythraea ER720 and selection and confirmation of stable resolvants 
were carried out essentially as described in EXAMPLE 4. DNA isolated from thiostrepton- 
sensitive colonies were employed in Southern hybridizations, using stringency conditions 
described above, to identify clones that had the EryATs domain replaced by NidAT6.The 
strain carrying such a replacement was designated Sac, erythraea HATl , 

15 

EXAMPLE 34: Analysis of compounds produced bv 
Sac, ervthraea HATl 

Different compounds were produced by Sac, erythraea HATl (EXAMPLE 33) 
20 depending upon the medium employed for growth. In one example, the culture was grown 

for 5 days at 30°C in 50 mL SCM medium (20 g Soytone, 15 g Soluble Starch, 10.5 g MOPS, 
1.5 g Yeast Extract and 0.1 g CaCl^per liter of distilled H^O) supplemented with 10 mM 
glycerol. The culture was then processed for TLC analysis essentially as described in 
EXAMPLE 5. Compounds appearing as blue spots mnning in the region of erythromycin A 

25 were detected. Mass spectrometry analysis of a 16 |iL sample of the extract identified a 
compound with mass 734, corresponding to the molecular ion plus a proton (M+H^) of 
erythromycin A, as well as a compound with mass 704, which is consistent with the 
molecular ion plus a proton (M+H^) of 6-deoxy-12-desmethyl-12-epierythromycin A. 

In a second experiment. Sac. erythraea HATSl was grown for 4 days at 30°C in 50 

30 mL of the following medium: 1 5 g com starch, 20 g soy flour, 1 .5 g dried brewer's yeast, 1 0 
g soybean oil, 1 g CaC03, 0.5 g MgSO4.7H20, 0.015 g FeS04, and 1 g sodium pyruvate per 
liter of distilled water. After growth, the culture was processed for TLC as described above. 
Compounds appearing as blue spots running in the region of erythromycin A were detected. 
Meiss spectrometry analysis of a small sample of the extract identified a compound with m£iss 

35 720, which is consistent with the molecular ion plus a proton (M+H^ of 12-desmethyl-12- 
epierythromycin A. 
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EXAMPLE 35: Cloning of rap liease-PKS containing fragments from 
Streptomvces hysroscovicus ATCC 29253 

5 A 0.8 kb fragment encoding a segment of the rapP gene (Schwecke et al , Proc. Natl, 

Acad. Sci. 92, 7839-7843 [1995]) was amplified by PGR from genomic DNA prepared from 
Streptomyces hygroscopicus ATCC 29253 using PGR conditions described in EXAMPLE 1 
and primers SEQ ID NO: 39 and SEQ ID NO: 40. The resulting 0.8 kb DNA fragment was 

labeled with 

32p 

using the Megaprime DNA labeling system (Amersham Life Science, 
10 Arlington Heights, IL) and used as a probe to isolate cosmids containing rapP and adjacent 
DNA in the following manner: a library of Streptomyces hygroscopicus ATCC 29253 
genomic DNA (EXAMPLE 1) was transferred to LB agar plates to yield approximately 3,000 
colonies/plate. After overnight growth the colonies were transferred from the plates to 
Hybond-N nylon membranes (Amersham Life Science, Arlington Heights, IL), and probed 
1 5 with the labeled rapP DNA segment following procedures outlined in Maniatis, et al. supra. 

Hybridization w£is performed at 65 ^C and a stringency wash was carried out with 0. Ix SSC 
at 65^0. Eleven positive cosmid clones were picked for restriction and PGR analysis. Five 
cosmids were identified to contain both the rapP gene as well as adjacent DNA containing 
segments of the rapA gene encoding a portion of the rapamycin PKS. The 5' end of the rapA 
20 gene encodes the function referred to hereinafter as " rapligase" which is required for the 

initiation of rapamycin biosynthesis. One of the cosmids, #2, was chosen as the DNA source 
for PGR and subcloning to construct DNA containing rapligase and downstream PKS 
domains for gene replacement. 

25 EXAMPLE 36: Construction of plasmid pSLl 180/rapligase 3.0 

Rapamycin biosynthesis is initiated by rapligase which employs the 
dihydrdxycyclohexyl-moiety of rapzimycin as the starter. Adjacent to the rapligase domain is 
the domain named ERS which is proposed to contain enoylreductase activity. A 3.0 kb 

30 segment of rapA encoding the rapligase and ERS was inserted into plasmid pSLl 1 80_ 

(Brosius, J., DNA 8, 759, 1989) using standard methods of recombinant DNA technology, as 
outlined in FIG. 3 1 , to yield the plasmid pSLl 1 80/rapligase 3.0. Since the rapligase-ERS- 
containing segment was to be used to replace the starting segment of the erythromycin PKS, 
thereby requiring the placement of the two unique restriction sites, Avrll and Nsil^ at the 5'- 

35 and 3 '-end of the 3.0 kb fragment, respectively, to facilitate subsequent cassette cloning, it 
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was necessary to assemble the 3.0 kb fragment containing rapligase and rapERS through a 
series of cloning steps shown in FIG. 31 . 

(a) Construction of dSLI 1 80/0.1 1 

5 

PGR primers SEQ ID NO: 41 and SEQ ID NO: 42 were used to amplify a 0.1 1 kb N- 
terminal fragment of the rapligase domain using cosmid #2 (EXAMPLE 35) as the DNA 
template. Primer SEQ ID NO: 42 was designed to contain an ^vrll site upstream of the 
sequence encoding the rapligase domain (FIG. 31). PGR reactions, cycling conditions and 
10 isolation of the desired fragment were as described in EXAMPLE 1 . The PGR product was 
then cloned into EcoRI/Sphl sites of pSLl 180, generating pSLl 180/0. 11 (FIG. 31), and the 
sequence fidelity was confirmed by nucleotide sequencing. 

(b) Gonstruction of pSLI 180/0.77 

15 

PGR primers SEQ ID NO: 43 and SEQ ID NO: 44 were used to amplify a 0.77 kb C- 
terminal fi-agment of the rapERS domain using cosmid #2 (EXAMPLE 35) as DNA template 
employing the conditions described immediately above. Primer SEQ ID NO:44 was 
designed to have a Nsil site immediately downstream from the ERS domain. The PGR 
20 product was then cloned into A7ioI///mdIII sites of pSLl 1 80, generating pSLl 1 80/0.77 (FIG. 
31). The sequence fidelity was confirmed by nucleotide sequencing. 

(c) Gonstruction of dSLI 180/0.1 1/2.1 

25 A 2.1 kb SphVXhol restriction fragment was isolated from cosmid #2 (EXAMPLE 35) 

and subcloned into the SphVXhol sites of the pSLl 180/0.1 1, generating pSLl 180/0.1 1/2.1 
(FIG. 31), 

(d) Gonstmction of pSLl 180/rapligase 3.0. 

30 

A 0.77 VbXhollHindlll fragment was isolated from pSLl 180/0.77 and subcloned into 
the J^oI/Z/mdlll sites of the pSLl 180/0.1 1/2.1, generating the plasmid pSLl 1 80/rapligase 
3.0 (FIG. 31). 

35 EXAMPLE 37: Gonstruction of plasmid pErvATs/rapligase 3.0 
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Plasmid pEryATs/rapligase 3.0 was constructed using standard methods of 
recombinant DNA technology according to the schematic outlines of FIG. 32. The plasmid 
cassette which contains eryATs flanking regions, pCS5/ATs-flank, was constructed as 
described in EXAMPLE 29. As outlined in FIG. 32, a 3.0 kb Avrll/Nsil fragment isolated 
5 from plasmid pSLl 1 80/rapligase 3.0 (EXAMPLE 35), was subcloned into the same sites of 
pCS5/ATs-flank, generating gene replacement/integration plasmid pEryATs/rapligase 3.0 
(FIG. 32). 

EXAMPLE 38: Construction of Sac, ervthraea ErvATs/rapligase 3.0 
10 An example of al3-desethyl-13-(3',4'-dihydroxycyclohexyl)methylerythromycin A 

producing microorganism was prepared by replacing the DNA fragment encoding the 
acyltransferase domain at the amino terminus of the erythromycin PKS (EryATs) of Sac. 
erythraea ER720 with the DNA fragment encoding rapligase-rapERS domains (EXAMPLE 
35) from S. hygroscopicus ATCC 29253. This was accomplished with the recombinant 
15 plasmid, pEryATs/rapligase 3.0, prepared as described in EXAMPLE 37. Transformation of 
Sac. erythraea ER720 and selection and confirmation of stable resolvants were carried out as 
described in EXAMPLE 4. Four thiostrepton sensitive colonies were selected and one of 
them was confirmed by Southern hybridization to have the EryATs replaced by rapligase- 
rapERS. The strain was named Sac. erythraea EryATs/rapligase 3,0. 

20 

EXAMPLE 39: Analysis of compounds produced bv Sac, ervthraea 

ErvATs/rapligase 3.0 

Sac. erythraea EryATs/rapligase 3.0, whose construction is described in EXAMPLE 
25 38, was grown in 50 mL of SCM medium (EXAMPLE 34) for 2 days at 30^0, then 
supplemented each day with ImM of 3,4-dihydroxycyclohexylcarboxylic acid for an 
additional 3 days. The culture then was processed for TLC analysis essentially as described 
in EXAMPLE 5. A novel compound predicted to be 13-desethyM3-(3%4'- 
dihydroxycyclohexyl)methylerythromycin A, appeared as a blue spot running slightly slower 
30 than erythromycin A. 

To detect biological activity, a TLC-bioautography assay was performed essentially as 
described in EXAMPLE 5 but using 20 \xh of the extracted sample. As with the positive 
controls, a small zone of inhibition developed around the sample spot indicating that the 
novel compound had bioactivity. 
35 To determine whether the novel spot seen on TLC had the molecular mass 

corresponding to the predicted 13-desethyl-13-(3%4'-dihydroxycyclohexyl)methyl- 
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erythromycin A, a small sample of the ethylacetate extract was further analyzed by mass 
spectrometry. The mass spectrometric analysis revealed the novel compound to have a mass 

of 820, which corresponds to the predicted molecular ion plus a proton (M+H"*") of 1 3- 
desethyl- 1 3-(3 ' ,4'-dihydroxycyclohexyl)methy lerythromycin A. 
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(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 925 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doiible 



(D) 


TOPOLOGY : 


linear 










(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO:l 








GGGCCGCTGG 


CGGTGATGTT 


CACCGGACAG 


GGCTCCCAAC 


GCCCCGGCAT 


GGGACGACAG 


60 


TTGTACGAGC 


ACTTCCCCGT 


CTTCGCCCAG 


GCACTGGACG 


AGGTCTTCGC 


ACTCGCCACC 


120 


CCCGGACTAC 


GCGAGGTGAT 


GTTCGACCCC 


GACCAGGCCG 


AAACACTCCA 


ACGCACCGAC 


180 


CACGCCCAGA 


TCGCCCTGTT 


CGCCTTCGAA 


ACCGCCCTCT 


ACCGACTCTG 


GGAATCCTGG 


240 


GGCCTGCGAC 


CCGACATGGT 


CTGCGGACAC 


TCGGTCGGAG 


AAATCACCGC 


AGCCCACGTC 


300 


TCCGGCACCC 


TCACCCTCCC 


CGACGCCGTC 


CACCTCGTCA 


CCACACGCGG 


CACCCTCATG 


360 


CAAAACCTGC 


CCCCCGGCGG 


CGCCATGCTC 


GCCGTCGCCA 


CCGACCCCCA 


CACCCTCCAA 


420 


CCCCACCTCG 


ACAACCACCA 


CGACACCATC 


TCCATCGCCG 


CCATCAACGG 


CCCCCACGCC 


480 


ACCGTCCTCT 


CCGGCGACCG 


CACCACCCTC 


CACCACATCG 


CCACCCAACT 


CAACACCAAA 


540 


CCCTTCACCA 


CCACCCTCAA 


CACCCTCACC 


CACCACCCCC 


CACACACACC 


CCTCATCAGC 


600 


ATGCTCACCG 


CCACACCCAC 


CCACCCCGAC 


ACCACCCACT 


GGACCCAGCA 


CATCACCGCA 


660 


CCCGTCCGCT 


ACACCGACAC 


CCTCCACCAC 


CTCCACCACC 


ACGGCATCAC 


CACCTACCTC 


720 


GAAATCGGCC 


CCGACACCAC 


CCTCACCGCC 


CTCGCCCGCA 


CCACCCTCCC 


CACCACCACC 


780 


CACCTCATCC 


CCACCACCCG 


CCGCAACCAC 


AACGAAGTCC 


GCAGCACGAA 


CGAGGCGTTG 


840 


GGCAGGGTGT 


TCAGCGTGGG 


CCACTCGGTG 


GACTGGCGGG 


CCCTCACTCC 


GACCGGGAGG 


900 


CGTACCTCCC 


TGCCGACGTA 


CCCCT 








925 



(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 103 0 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNES S : doiibl e 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

CCTAGGACGG CAGTCCTGCT CACCGGGCAG GGTTCCCAGC GTCAGGGCAT GGGGCGCGAA 60 

CTGTACGACC GGTCACCGGT GTTCGCCGCC TCGTTCGACG CGATCTGCGC TCAACTCGAC 12 0 

GGGCAACTGC CTCGTCCCCT CAAGGACGTT CTCTTCGCCC CCGAGGGGTC GGAGGACGCC 180 

GCGCTCATCG ACCGTACGGT GTTCACACAG GCGGCTCTGT TCGCCGTGGA GACCTCCCTG 240 

TTCCGGCTGT TCGAGGCCCA CGGCCTCGTC CCCGACTACC TCATCGGCCA CTCCATCGGC 3 00 

GAAGTGACCG CGGCCCACCT GGCCGGGGTC CTCGATCTGG CGGACGCGTG CGTCCTGGTC 36 0 

GCCCACCGCG GCCGCCTGAT GCAGTCGGCC CGGGCCGGCG GCGCGATGGC CGCGGTCCAG 42 0 

GCGAGCGAGG ACGAGGTACG CGAGGCCCTC GCGACCTTCG ACGATGCGGT TGCCGTGGCC 480 

GGAGTCAACG GCCCGAACGC CACCGTCGTC TCCGGCGACG AGGACGCGGT CGAGCGGCTG 540 

GTCGCGCGCT GGCGCGAGCA GGGCAGGCGG ACGAAGCGGC TGCCGGTCAG CCACGCCTTC 600 

CACTCGCCGC ACATGGACGG GATCGTCGAC GAGTTCGTCA CCGCCGTCTC CGGGCTCACC 660 

TTCCGCTCCC CGACGATCCC GGTCGTCTCC AACGTCACCG GGACCCTCGC CACCGTCGAC 720 

CAGCTGACCT CGCCCGCGTA CTGGGCACGC CACATCCGCG AGGCCGTGCG CTTCGCCGAC 78 0 

GGGGTGCGGT ACCTGGAGGG CGAGGGCGTC ACCGAATGGC TGGAGCTCGG 6CCCGACGGC 840 
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GTTCTCGTCG CCCTGGTCGA GGACTGCCTG GCGAAGGAGG CGGGATCGCT CGCGTCCGCC 
CTGCGCAAGG GGGCGAGCGA GCCCCACACC GTGGGCGCGG CCATGGCCCG CGCGGTGCTG 
CGCGGATCCG GCCCCGACTG GGCGGCGGTG TTCCCCGGCG CACGGCGGGT CGACCTTCCG 
ACGTATGCAT 

(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
ATCTACACST CSGGCACSAC SGGCAAGCCS JVAGGG 35 
(2) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 
CTSAAGGCSG GCGGCGCSTA CGTSCCSATC GACCC 35 
(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: .3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
CGCGAATTCC TAGGCTGGCG GTGATGTTCA 30 
(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



900 
960 
1020 
1030 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 



GCCGGATCCA TGCATACGTC GGCAGGGAGG TAC 



33 



(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
GCTCGAATTC GCTGGTCGCG GTGCACCT 28 
(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : 1 inear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
GACGGATCCG GCCCTAGGCT GCGCCCGGCT CG 32 
(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHJ\RACTERISTICS : 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
TTGGGATCCT ATGCATTCCA GCGCGAGCGC 30 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRT^EDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
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GAGAAGCTTG GCGCGACTTG CCCGCT 26 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
TTTTTTAAGC TTGGTACCTG CTCACCGGCA ACACCG 36 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
TTTTTTGGAT CCCTGCAGCC TAGGGTCGGA GGCACTGCCG GT 42 
(2) INFORMATION FOR SEQ ID NO: 13: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
TTTTTTCTGC AGTATGCATT CCAGGGCAAG CiSGTTCT 37 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : s ingl e 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
TTTTTTGAAT TCACGCGTTG CCCGCGGCGT AGGCGC 36 
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(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single . 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
GATCGAATTC CCTAGGACGG CAGTCCTGCT CACC 34 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
GATCGGATCC ATGCATACGT CGGAAGGTCG ACCCG 35 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
TTCGAAGAAT TCCCTAGGGT TGCCTTCCTG TTCGAC 36 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
TTCGAAAAGC TTATGCATAG ACCGGCAGAT CCACCG 36 
(2) INFORMATION FOR SEQ ID NO: 19: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOIiOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
CGGTSAAGTC SAACATCGG 19 
(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : s ingl e 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
GCRATCTCRC CCTGCGARTG 20 
(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
GAGAGAGGAA CCAACGCGCA CGTGATCGTC GAAGAGGCAC CAGC 44 
(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
GAGAGAGGAT CCGACCTAGG CGCGGAGGTC ACCGGCGCGA CGGCG 45 
(2) INFORMATION FOR SEQ ID NO: 23: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 
GAGAGACCTA GGAAGCCGGT GTTCGTGTTC CCCGGCCAGG GOT 43 
(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : 1 inear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
GAGAGAGGAT CCGAGGCCGG CCGTGCGCCC GGACCGAAGA CCGCCTC 47 
(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
GAGAGAATTC CCTAGGGTCG CCTTCGTCTT TCCCGGGCAG G 41 
(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

TTGAGATCTT ATGCATACGA GGGAAGCGGC ACCCTGC 37 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 37 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
TTGAGATCTT ATGCATACGA GGGAAGCGGC ACCCTGC 37 
(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: 
TTGAGATCTT ATGCATACGA GGGAAGCGGC ACCCTGC 37 
(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1010 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

GCCGACCGTG TCGTGTTCGT GTTCCCCGGC CAGGGCTCGC AGTGGGCCGG AATGGCCGAG 60 

GGGCTGCTGG AGCGGTCCGG CGCGTTCCGG AGTGCGGCCG ACTCGTGCGA CGCCGCGCTG 120 

CGGCCGTACC TCGGCTGGTC GGTGCTGAGC GTGCTGCGCG GGGAACCGGA CGCGCCCTCG 18 0 

CTCGACCGGG TCGACGTCGT GCAGCCGGTG CTGTTCACGA TGATGGTCTC GCTCGCGGCG 240 

GTCTGGCGTG CGCTGGGGGT GGAACCGGCG GCGGTCGTCG GGCACTCGCA GGGTGAGATC 300 

GCCGCTGCCC ATGTCGCCGG TGCGCTGTCG CTGGACGACT CGGCCCGGAT CGTCGCCCTG 360 

CGC:AGTCGGG CGTGGCTCGG ACTGGCGGGC AAGGGCGGCA TGGTGGCGGT GCCGATGCCG 420 

GCGGAGGAGC TGCGGCCGCG GCTGGTGACG TGGGGGGACC GTCTGGCCGT CGCCGCCGTC 480 

AACAGCCCCG GTTCCTGCGC CGTCGCAGGC GACCCGGAGG CGCTGGCCGA ACTGGTGGCG 540 

CTGCTGACCG GTGAGGGGGT GCACGCCCGG CCGATCCCCG GCGTCGACAC GGCGGGCCAC 600 

TCGCCGCAGG TGGACGCGTT GCGGGCTCAT CTGCTGGAGG TGCTGGCCCC GGTCGCCCCC 660 

CGACCGGCCG ACATCCCGTT CTACTCGACG GTGACCGGCG GGCTGCTGGA CGGCACCGAG 720 

CTGGACGCGA CGTACTGGTA CCGCAACATG CGCGAGCCCG TCGAGTTCGA GCGGGCCACA 780 

CGGGCGCTGA TCGCCGACGG GCACGACGTC TTCCTGGAGA CGAGCCCGCA TCCCATGCTG 84 0 

GCCGTGGCGC TGGAGCAGAC GGTCACCGAC GCCGGCACCG ACGCGGCGGT GCTCGGGACC 900 

CTGCGCCGCC GCCACGGCGG TCCTCGCGCG CTGGCCCTGG CCGTCTGCCG CGCCTTCGCG 960 

AGGCGGTCTT CGGTCCGGGC GCACGGCCCG TGGAGTTGCC CACCTATCCG 1010 

(2) INFORMATION FOR SEQ ID NO: 30: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1035 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

CGCGCGCCTG CCTTCGTCTT TCCCGGGCAG GGCGCCCAGT GGGCCGGACT GGGAGCGCGG 60 

CTCCTCGCGG ACTCCCCCGT CTTCCGCGCC AGGGCCGAGG CATGCGCGCG GGCGCTGGAG 120 

CCTCACCTCG ACTGGTCGGT CCTCGACGTG CTGGCCGGCG CCCCGGGCAC CCCTCCCATC 180 

GACCGGGCCG ACGTGGTGCA GCCGGTGCTG TTCACCACGA TGGTCTCGCT GGCCGCCCTC 24 0 

TGGGAGGCCC ACGGGGTGCG GCCGGCCGCG GTCGTGGGCC ACTCCCAGGG CGAGGTGGCC 3 00 

GCGGCCTGCG TGGCCGGTGC CCTGTCGCTG GACGACGCTG CCCTGGTGAT CGCCGGACGC 36 0 

AGCAGGCTGT GGGGGCGGCT GGCCGGGAAC GGCGGGATGC TCGCGGTGAT GGCTCCGGCC 420 

GAGCGGATCC GTGAGCTGCT CGAACCATGG CGGCAGCGGA TTTCGGTGGC GGCGGTCAAT 480 

GGCCCCGCCT CGGTCACCGT CTCCGGTGAC GCGCTCGCGC TGGAGGAGTT CGGCGCGCGG 54 0 

CTCTCCGCCG AGGGGGTGCT GCGCTGGCCG CTGCCGGGCG TCGACTTCGC CGGCCACTCG 600 

CCGCAGGTGG AGGAGTTCCG CGCTGAGCTC CTGGACCTGC TCTCCGGCGT ACGGCCGGCT 660 

CCTTCGCGGA TACCTTTCTT CTCCACCGTG ACGGCGGGTC CTTGCGGCGG CGACCAGCTG 72 0 

GACGGGGCGT ACTGGTACCG CAACACGCGC GAACCCGTGG AGTTCGACGC CACGGTCCGG 780 

GCGCTGCTGC GTGCGGGCCA TCACACGTTC ATCGAGGTCG GTCCGCATCC GCTGCTCAAC 840 

GCCGCGATCG ACGAGATCGC AGCGGACGAG GGGGTAGCGG CCACGGCCCT GCATACGCTC 900 

CAGCGGGGCG CTGGCGGCCT TGACCGCGTG CGCAACGCGG TGGGCGCCGC TTTCGCGCAC 96 0 

GGTGTCCGGG TCGACTGGAA CGCCCTGTTC GAGGGCACCG GTGCGCGCAG GGTGCCGCTT 1020 

CCCTCGTACG CCTTC 1035 

(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 328 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

Gly Pro Leu Ala Val Met Phe Thr Gly Gin Gly Ser Gin Arg Pro Gly 

15 10 15 

Met Gly Arg Gin Leu Tyr Glu His Phe Pro Val Phe Ala Gin Ala Leu 

20 25 30 

Asp Glu Val Phe Ala Leu Ala Thr Pro Gly Leu Arg Glu Val Met Phe 

35 40 45 

Asp Pro Asp Gin Ala Glu Thr Leu Gin Arg Thr Asp His Ala Gin lie 

50 55 60 

Ala Leu Phe Ala Phe Glu Thr Ala Leu Tyr Arg Leu Trp Glu Ser Trp 
65 70 75 80 

Gly Leu Arg Pro Asp Met Val Cys Gly His Ser Val Gly Glu lie Thr 

85 90 95 

Ala Ala His Val Ser Gly Thr Leu Thr Leu Pro Asp Ala Val His Leu 
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100 

Val Thr Thr Arg 
115 

Met Leu Ala Val 
130 

Asn His His Asp 
145 

Thr Val Leu Ser 

Leu Asn Thr Lys 
180 

Pro Leu Met Gin 
195 

Leu Thr His His 

210 

Thr Pro Thr His 
225 

Pro Val Arg Tyr 

Thr Thr Tyr Leu 
260 

Arg Thr Thr Leu 
275 

Asn His Asn Glu 
290 

Ser Val Gly His 
305 

Arg Thr Ser Leu 



Gly Thr Leu Met 
120 

Ala Thr Asp Pro 
135 

Thr lie Ser lie 

150 

Gly Asp Arg Thr 
165 

Thr Asn Trp Leu 

Pro lie Leu Gin 
200 

Pro Pro His Thr 

215 

Pro Asp Thr Thr 
230 

Thr Asp Thr Leu 
245 

Glu lie Gly Pro 

Pro Thr Thr Thr 

280 

Val Arg Ser Thr 
295 

Ser Val Asp Trp 
310 

Pro Thr Tyr Pro 
325 



105 

Gin Asn Leu Pro 

His Thr Leu Gin 
140 

Ala Ala lie Asn 
155 

Thr Leu His His 
170 

Asn Val Ser His 
185 

Pro Phe Thr Thr 

Pro Leu lie Ser 
220 

His Trp Thr Gin 
235 

His His Leu His 
250 

Asp Thr Thr Leu 
265 

His Leu lie Pro 

Asn Glu Ala Leu 
300 

Arg Ala Leu Thr 
315 



110 

Pro Gly Gly Ala 
125 

Pro His Leu Asp 

Gly Pro His Ala 
160 

He Ala Thr Gin 

175 

Ala Phe His Ser 
190 

Thr Leu Asn Thr 
205 

Met Leu Thr Ala 

His He Thr Ala 
240 

His His Gly He 
255 

Thr Ala Leu Ala 
270 

Thr Thr Arg Arg 

285 

Gly Arg Val Phe 

Pro Thr Gly Arg 
320 



(2) INFORMATION FOR SEQ ID NO: 32: 



<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 3 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 



(ii) MOLECULE TYPE: None 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

Pro Arg Thr Ala Val Leu Leu Thr Gly Gin Gly Ser Gin Arg Gin Gly 

15 10 15 

Met Gly Arg Glu Leu Tyr Asp Arg Ser Pro Val Phe Ala Ala Ser Phe 

20 25 30 

Asp Ala He Cys Ala Gin Leu Asp Gly Gin Leu Pro Arg Pro Leu Lys 

35 40 45 

Asp Val Leu Phe Ala Pro Glu Gly Ser Glu Asp Ala Ala Leu He Asp 

50 55 60 

Arg Thr Val Phe Thr Gin Ala Ala Leu Phe Ala Val Glu Thr Ser Leu 
65 70 75 80 

Phe Arg Leu Phe Glu Ala His Gly Leu Val Pro Asp Tyr Leu He Gly 
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85 90 95 

His Ser lie Gly Glu Val Thr Ala Ala His Leu Ala Gly Val Leu Asp 

100 105 110 

Leu Ala Asp Ala Cys Val Leu Val Ala His Arg Gly Arg Leu Met Gin 

115 120 125 

Ser Ala Arg Ala Gly Gly Ala Met Ala Ala Val Gin Ala Ser Glu Asp 

130 135 140 

Glu Val Arg Glu Ala Leu Ala Thr Phe Asp Asp Ala Val Ala Val Ala 
145 150 155 160 

Gly Val Asn Gly Pro Asn Ala Thr Val Val Ser Gly Asp Glu Asp Ala 

165 170 175 

Val Glu Arg Leu Val Ala Arg Trp Arg Glu Gin Gly Arg Arg Thr Lys 

180 185 190 

Arg Leu Pro Val Ser His Ala Phe His Ser Pro His Met Asp Gly lie 

195 200 205 

Val Asp Glu Phe Val Thr Ala Val Ser Gly Leu Thr Phe Arg Ser Pro 

210 215 220 

Thr lie Pro Val Val Ser Asn Val Thr Gly Thr Leu Ala Thr Val Asp 
225 230 235 240 

Gin Leu Thr Ser Pro Ala Tyr Trp Ala Arg His lie Arg Glu Ala Val 

245 250 -255 

Arg Phe Ala Asp Gly Val Arg Tyr Leu Glu Gly Glu Gly Val Thr Glu 

260 265 270 

Trp Leu Glu Leu Gly Pro Asp Gly Val Leu Val Ala Leu Val Glu Asp 

275 280 285 

Cys Leu Ala Lys Glu Ala Gly Ser Leu Ala Ser Ala Leu Arg Lys Gly 

290 295 300 

Ala Ser Glu Pro His Thr Val Gly Ala Ala Met Ala Arg Ala Val Leu 
305 310 315 320 

Arg Gly Ser Gly Pro Asp Trp Ala Ala Val Phe Pro Gly Ala Arg Arg 

325 330 335 

Val Asp Leu Pro Thr Tyr Ala 
340 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 344 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

Ala Asp Arg Val Val Phe Val Phe Pro Gly Gin Gly Ser Gin Trp Ala 

15 10 15 

Gly Met Ala Glu Gly Leu Leu Glu Arg Ser Gly Ala Phe Arg Ser Ala 

20 25 30 

Ala Asp Ser Cys Asp Ala Ala Leu Arg Pro Tyr Leu Gly Trp Ser Val 

35 40 45 

Leu Ser Val Leu Arg Gly Glu Pro Asp Ala Pro Ser Leu Asp Arg Val 
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50 55 60 

Asp Val Val Gin Pro Val Leu Phe Thr Met Met Val Ser Leu Ala Ala 
65 70 75 80 

Val Trp Arg Ala Leu Gly Val Glu Pro Ala Ala Val Val Gly His Ser 

85 90 95 

Gin Gly Glu lie Ala Ala Ala His Val Ala Gly Ala Leu Ser Leu Asp 

100 105 110 

Asp Ser Ala Arg He Val Ala Leu Arg Ser Arg Ala Trp Leu Gly Leu 

115 120 125 

Ala Gly Lys Gly Gly Met Val Ala Val Pro Met Pro Ala Glu Glu Leu 

130 135 140 

Arg Pro Arg Leu Val Thr Trp Gly Asp Arg Leu Ala Val Ala Ala Val 
145 150 155 160 

Asn Ser Pro Gly Ser Cys Ala Val Ala Gly Asp Pro Glu Ala Leu Ala 

165 170 175 

Glu Leu Val Ala Leu Leu Thr Gly Glu Gly Val His Ala Arg Pro He 

180 185 190 

Pro Gly Val Asp Thr Ala Gly His Ser Pro Gin Val Asp Ala Leu Arg 

195 200 205 

Ala His Leu Leu Glu Val Leu Ala Pro Val Ala Pro Arg Pro Ala Asp 

210 215 220 

He Pro Phe Tyr Ser Thr Val Thr Gly Gly Leu Leu Asp Gly Thr Glu 
225 230 235 240 

Leu Asp Ala Thr Tyr Trp Tyr Arg Asn Met Arg Glu Pro Val Glu Phe 

245 250 255 

Glu Arg Ala Thr Arg Ala Leu He Ala Asp Gly His Asp Val Phe Leu 

260 265 270 

Glu Thr Ser Pro His Pro Met Leu Ala Val Ala Leu Glu Gin Thr Val 

275 280 285 

Thr Asp Ala Gly Thr Asp Ala Ala Val Leu Gly Thr Leu Arg Arg Arg 

290 295 300 

His Gly Gly Pro Arg Ala Leu Ala Leu Ala Val Cys Arg Ala Phe Ala 
305 310 315 320 

His Gly Val Glu Val Asp Pro Glu Ala Val Phe Gly Pro Gly Ala Arg 

325 330 335 

Pro Val Glu Leu Pro Thr Tyr Pro 
340 



(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 345 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

Arg Ala Pro Ala Phe Val Phe Pro Gly Gin Gly Ala Gin Trp Ala Gly 

1 5 10 15 

Leu Gly Ala Arg Leu Leu Ala Asp Ser Pro Val Phe Arg Ala Arg Ala 



BNSOOCID: <WO ^9851695A^!_> 



wo 98/51695 



PCTAJS98/09518 



89 



20 

Glu Ala Cys Ala 

35 

Asp Val Leu Ala 
50 

Val Val Gin Pro 
65 

Trp Glu Ala His 

Gly Glu Val Ala 
100 

Ala Ala Leu Val 
115 

Gly Asn Gly Gly 
130 

Glu Leu Leu Glu 

145 

Gly Pro Ala Ser 

Phe Gly Ala Arg 
180 

Gly Val Asp Phe 
195 

Glu Leu Leu Asp 

210 

Pro Phe Phe Ser 
225 

Asp Gly Ala Tyr 

Ala Thr Val Arg 
260 

Val Gly Pro His 
275 

Asp Glu Gly Val 
290 

Gly Gly Leu Asp 
305 

Gly Val Arg Val 

Arg Val Pro Leu 
340 



Arg Ala Leu Glu 
40 

Gly Ala Pro Gly 
55 

Val Leu Phe Thr 
70 

Gly Val Arg Pro 

85 

Ala Ala Cys Val 

lie Ala Gly Arg 
120 

Met Leu Ala Val 
135 

Pro Trp Arg Gin 

150 

Val Thr Val Ser 
165 

Leu Ser Ala Glu 

Ala Gly His Ser 
200 

Leu Leu Ser Gly 
215 

Thr Val Thr Ala 
230 

Trp Tyr Arg Asn 
245 

Ala Leu Leu Arg 

Pro Leu Leu Asn 

280 

Ala Ala Thr Ala 
295 

Arg Val Arg Asn 
310 

Asp Trp Asn Ala 

325 

Pro Ser Tyr Ala 



25 

Pro His Leu Asp 

Thr Pro Pro lie 
60 

Thr Met Val Ser 
75 

Ala Ala Val Val 
90 

Ala Gly Ala Leu 
105 

Ser Arg Leu Trp 

Met Ala Pro Ala 
140 

Arg lie Ser Val 

155 

Gly Asp Ala Leu 
170 

Gly Val Leu Arg 
185 

Pro Gin Val Glu 

Val Arg Pro Ala 
220 

Gly Pro Cys Gly 
235 

Thr Arg Glu Pro 
250 

Ala Gly His His 
265 

Ala Ala lie Asp 

Leu His Thr Leu 
300 

Ala Val Gly Ala 
315 

Leu Phe Glu Gly 
330 

Phe 
345 



30 

Trp Ser Val Leu 

45 

Asp Arg Ala Asp 

Leu Ala Ala Leu 
80 

Gly His Ser Gin 
95 

Ser Leu Asp Asp 
110 

Gly Arg Leu Ala 
125 

Glu Arg He Arg 

Ala Ala Val Asn 
160 

Ala Leu Glu Glu 
175 

Trp Pro Leu Pro 
190 

Glu Phe Arg Ala 
205 

Pro Ser Arg He 

Gly Asp Gin Leu 
240 

Val Glu Phe Asp 
255 

Thr Phe He Glu 
270 

Glu He Ala Ala 

285 

Gin Arg Gly Ala 

Ala Phe Ala His 
320 

Thr Gly Ala Arg 
335 



(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 
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TTTGAATTCA CGTCCTCGAC GTGCAGCA 



28 



(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
{ D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 
TTTGGATCCC CTAGGGGACG GCCGGGCCAC GCC 33 
(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
{ D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 
TTTGGATCCA TGCATCTGCC GGAGTTCGCC CCG 33 
(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 
TTTAAGCTTG CGCCCGCCCG TTGGGC 26 



(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 
ATGGCTTCCG ACAGTCCCCG CCCAAGGCCG 30 
(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:40: 
ACCAATTCCG TCGGCGGGCA CCAGGCCACC 30 
(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(xi ) SEQUENCE DESCRIPTION : SEQ ID NO : 41 : 
TTTTGAATTC CCTAGGATGT CACGCGCGGA ACTGG 3 5 

(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 
TTTTGCATGC GTCAGTGCGA GCCG 24 
(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:43: 

TTTTCTCGAG GTCGGCCCGG AAGT 24 

(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 
TTTTAAGCTT ATGCATGTCG AGTCGCCGGG GAATGG 36 
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What is claimed is: 

1 . A compound of the fomiula: 



O 




X 

wherein 

Rh R3. R4> R55 and R6 are independently selected from Q wherein Q is selected 

from the group consisting of (a) -H, (b) -Me, (c) -Et, and (d) -OH; 

R7 is selected from the group consisting of -Et, -HOMe (hydroxymethyl), and 3,4- 

dihydroxycyclohexylmethyl; 

Li and L2 are independently -H or -OH; 

L3 is D-desosamine or -OH; and 

L4 is L-mycarose, L-cladinose or -OH 
with the proviso that when R7 is -Et and R1-R5 are -Me, R6 is other than -H or -Me 

2. The compound of claim 1 wherein Q is selected from the group consisting of (a), (b), 
and (c), R7 is -Et and Li, L2, L3 and L4 are as defined therein. 

3. The compound of claim 1 wherein Q is selected from the group consisting of (a), (b), 
and (d), R7 is -Et and Li , L2, L3 and L4 are as defined therein. 

4. The compound of claim 1 wherein Q is selected from the group consisting of (a), (c), 
and (d), R7 is -Et and Li , L2, L3 and L4 are as defined therein. 
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5. The compound of claim 1 wherein Q is selected from the group consisting of (b), (c), 
and (d), R7 is -Et and Li, L2, L3 and L4 are as defined therein. 



6. The compound of claim 1 wherein 

(a) R6 and R\ are -H and R2, R3, R4 and R5 are -Me, 

(b) R5 and Ri are -H and R2, R3, R4 and are -Me, 

(c) R4 and Ri are -H and R2, R3, R5 and Re are -Me, 

(d) R3 and Ri are -H and R2, R4, R5 and R6 are -Me, 

(e) R2 and R| are -H and R3, R4, R5 and R6 are -Me, 

(f) Rs and R2 are -H and R] , R3, R4 and R5 are -Me, 

(g) R5 and R2 are -H and Ri , R3, R4 and Rg are -Me, 

(h) R4 and R2 are -H and R], R3, R5 and R^ are -Me, 

(i) R3 and R2 are -H and Ri , R4, R5 and R^ are -Me, 
(j) R6 and R3 are -H and Rj , R2, R4 and R5 are -Me, 
(k) R5 and R3 are -H and Ri, R2, R4 and Re are -Me, 
(1) R4 and R3 are -H and Ri, R2, R5 and Re are -Me, 
(m) Re and R4 are -H and Ri , R2, R3 and R5 are -Me, 
(n) R5 and R4 are -H and R] , R2, R3 and R6 are -Me, or 
(o) Re and R5 are -H and R\ , R2, R3 and R4 are -Me; 
R7 is -Et; and Li, L2, L3 and L4 are as defined therein. 

7. The compound of claim 6 wherein (a)-(o) and R7 are as defined therein, Lj and L2 are 
-OH, L3 is D-desosamine and L4 is L-cladinose. 



8. The compound of claim 1 wherein 

(a) R6, R2 and R\ are -H and R3, R4 and R5 are -Me, 

(b) R5, R2 and R\ are -H and R3, R4 and Re are -Me, 

(c) R4, R2 and R] are -H and R3, R5 and Re are -Me, 

(d) R3, R2 and R] are -H and R4, R5 and Re are -Me, 

(e) R6, R3 and R\ are -H and R2, R4 and R5 are -Me, 

(f) R5, R3 and R] are -H and R2, R4 and Re are -Me, 

(g) R4, R3 and Ri are -H and R2, R5 and Re are -Me, 

(h) R6, R4 and R\ are -H and R2, R3 and R5 are -Me, 

(i) R5, R4 and Ri are -H and R2, R3 and Re are -Me, 
(j) R^, R5 and Ri are -H and R2, R3 and R4 are -Me, 
(k) R6, R3 and R2 are -H and Ri, R4 and R5 are -Me, 
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(1) R5, R3 and R2 are -H and Ri , R4 and Rs are -Me, 

(m) R4, R3 and R2 are -H and Ri, R5 and Re are -Me, 

(n) Re, R4 and R2 are -H and R], R3 and R5 are -Me, 

(o) R5, R4 and R2 are -H and R] , R3 and R^ are -Me, 

(p) Re, R5 and R2 are -H and R], R3 and R4 are -Me, 

(q) Re, R4 and R3 are -H and Ri , R2 and R5 are -Me, 

(r) R5, R4 and R3 are -H and R], R2 and Re are -Me, 

(s) Re, R5 and R3 are -H and Ri, R2 and R4 are -Me, or 

(t) Re, R5 and R4 are -H and Ri , R2 and R3 are -Me; 
R7 is -Et and Lj, L2, L3 and L4 are as defined therein. 

9. The compound of claim 8 wherein (a)-(t) and R7 are as defined therein, Li and L2 are 
-OH, L3 is D-desosamine and L4 is L-cladinose. 

10. The compound of claim 1 wherein 

(a) Re, R3, R2 and R] are -H and R5, and R4 are -Me, 

(b) R5, R3, R2 and Rj are -H and Re, and R4 are -Me, 

(c) R4, R3, R2 and R] are -H and R5, and Re are -Me, 

(d) Re, R4, R2 and R\ are -H and R3, and R5 are -Me, 

(e) R5, R4, R2 and R] are -H and R3, and Re are -Me, 

(f) Re, R5, R2 and R\ are -H and R3, and R4 are -Me, 

(g) Re, R4, R3 and R\ are -H and R2, and R5 are -Me, 

(h) R5, R4, R3 and R] are -H and R2, and Re are -Me, 

(i) Re, R5, R4 and Rj are -H and R2, and R3 are -Me, 
(j) R2, R4, R3 and R\ are -H and R5, and Re are -Me, 
(k) Re, R4, R3 and R2 are -H and Ri, and R5 are -Me, 
(I) R5, R4, R3 and R2 are -H and Ri, and Re are -Me, 
(m) Re, R5, R3 and R2 are -H and Ri, and R4 are -Me, or 
(n) Re, R5, R4 and R3 are -H and Ri , and R2 are -Me; 

R7 is -Et and Li, L2, L3 and L4 are as defined therein. 

1 1 . The compound of claim 1 0 wherein (a)-(n) and R7 are as defined therein, Li and L2 
are -OH, L3 is D-desosamine and L4 is L-cladinose. 

12. The compoimd of claim 1 wherein 

(a) R5, R4, R3, R2 and R\ are -H and R^ is -Me, 

(b) Re, R4, R3, R2 and Ri are -H and R5 is -Me, 
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(c) R6, R5, R3, R2 and Ri are -H and R4 is -Me, 

(d) R6, R5, R4, R2 and R\ are -H and R3 is -Me, 

(e) R6, R5, R4, R3 and Ri are -H and R2 is -Me, or 

(f) R6, R5, R4, R3 and R2 are -H and R\ is -Me; 
R7 is -Et and Li, L2, L3 and L4 are as defined therein. 

13. The compound of claim 12 wherein (a)-(f) and R7 are as defined therein, Li and L2 
are -OH, L3 is D-desosamine and L4 is L-cladinose. 

14. The compound of claim 1 wherein Ri, R2, R3, R4, R5 and Re are -H and R?, Li, L2, 
L3 and L4 are 21s defined therein. 

15. The compound of claim 14 wherein Ri, R2, R3» R4, Rs, R6 and R? are as defined 
therein, Li and L2 are -OH, L3 is D-desosamine and L4 is L-cladinose. 

16. The compound of claim 1 selected from the group consisting of 6,10-didesmethyl-6- 
ethylerythromycin A; 10,12-didesmethyl-12-deoxy-12-ethylerythromycin A; 10,12- 
didesmethyl-12-deoxy-lO-hydroxyerythromycin A; 6,10,12-tridesmethyl-6,12- 
diethylerythromycin A, and 6,10,12-tridesmethyl-6-deoxy-6,12-diethylerythromycin A. 

17. The compoxmd of cleum 1 selected from the group consisting of 10- 
desmethylerythronolide B, 1 0-desmethyl-6-deoxyerythronolide B, 12-desmethylerythronolide 
B, 12-desmethyl-6-deoxyerythronoHde B, 12-desmethyl-12-ethylerythronolide B, 6- 
desmethyl-6-deoxy-6-ethylerythronoiide B, 1 0-desmethylerythromycin A, lO-desmethyl-12- 
deoxy erythromycin A, 10-desmethy 1-6, 12-dideoxy erythromycin A, 12- 
desmethylerythromycin A, 12-desmethyl-12-deoxy erythromycin A, 12-desmethyl-6,12- 
dideoxyerythromycin A, 6-desmethyl-6-ethylerythromycin A, 12-desmethyl-12- 

ethy Erythromycin A, 12-desmethyl-12-deoxy-12-ethy Erythromycin A, 10-desmethyl-lO- 
hydroxy erythromycin A, 12-desmethyl-12-epihydroxy erythromycin A, 10,12- 
didesmethylerythromycin A, 10,12-didesmethyl-12-deoxy erythromycin A, and 10,12- 
didesmethyl-6,12-dideoxyerythromycin A. 

1 8. The compound of claim 1 selected from the group consisting of 1 0- 
desmethylerythronolide B, lO-desmethyl-6-deoxyerythronolide B, 12-desmethylerythronolide 
B, 12-desmethyl-6-deoxyerythronolideB, lO-desmethylerythromycin A, lO-desmethyl-12- 
deoxyerythromycin A, 1 0-desmethy 1-6, 1 2-dideoxyerythromycin A, 1 2- 
desmethylerythromycin A, 12-desmethyl-12-deoxyerythromycin A, 12-desmethyl-6,12- 
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dideoxyerythromycin A, 10J2-didesmethylerythromycin A, 10,12-didesmethyH2- 
deoxyerythromycin A, and 10,12-didesmethyl-6,12-dideoxyerythromycin A. 

19. A compound selected from the group consisting of 10-desmethylerythromycin A, 10- 
desmethyl-12-deoxyerythromycin A, and 12-desmethyl-12-deoxy erythromycin A. 

20. The compound of claim 1 selected from the group consisting of 8-desmethyl-8- 
hydroxyerythromycin A, 6-desmethyl-6-epierythromycin A, 4-desmethyl-4- 
hydroxy erythromycin A, 2-desmethyl-2-hydroxyerythromycin A and 13-desethyi-13- 
hydroxymethol erythromycin A. 

21. The compound of claim 1 selected from the group consisting of 2,12-didesmethyl- 
2, 1 2-dihydroxyerythromycin, 4, 1 0-didesmethy 1-4, 1 0-dihydroxyerythromycin, 10,12- 
didesmethyl-lO-hydroxyerythromycin, and 6,10-didesmethyl-6-ethyl-10- 
hydroxyerythromycin A. 

22. The compound of claim 1 which is 13-desethyl-13-(3%4'- 
dihydroxycyclohexyl)methylerythromycin A. 

23. An isolated polynucleotide sequence or fragment thereof which encodes an 
enzymatically active acyltransferase domain from a polyketide-producing microorganism 
selected from the group consisting of Streptomyces hygroscopicus^ Streptomyces venezuelae^ 
and Streptomyces caelestis, 

24. The polynucleotide of Claim 23 selected from the group consisting of SEQ ID NO:l, 
SEQ ID NO:2, SEQ ID NO:29 and SEQ ID NO:30. 

25. The polynucleotide of Claim 23 wherein said acyltransfereise domain is selected from 
the group consisting of SEQ ID NO:3 1 , SEQ ID NO:32, SEQ ID NO:33 and SEQ ID NO:34. 

26. A vector comprising a polynucleotide sequence or fragment thereof which encodes an 
enzymatically active acyltransferase domain from Streptomyces. 

27. The vector of Claim 26 wherein said Streptomyces is selected from the group 
consisting of Streptomyces hygroscopicuSy Streptomyces venezuelae^ and Streptomyces 
caelestis. 
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28. The vector of Claim 26 wherein said polynucleotide is selected from the group 
consisting of SEQ ID N0:1, SEQ ID N0:2, SEQ ID NO:29 and SEQ ID NO:30. 

29. The vector of Claim 26 wherein said acyltransferase domain is selected from the 
group consisting of SEQ ID NO:31, SEQ ID NO:32, SEQ ID NO:33 and SEQ ID NO:34. 

30. A vector selected from the group consisting of pUCl 8/LigAT2, pEryATl/LigAT2, 
pEryAT2/LigAT2, pUC18/venAT, pEryATl/venAT, pUC19/rapAT14, pEryATl/rapATH, 
pEryAT2/rapAT14, pUC/5*-flank/ethAT, pUC/ethAT/C-6, pEAT4, pUC18/NidAT6, 
pEryAT2/NidAT6, pEryATs/NidAT6, and pEryATs/rapligase 3.O.. 

31. A host cell transformed with the vector of Claim 32. 

32. The host cell of Claim 3 1 wherein said cell is a bacterial cell or a polyketide- 
producing microorganism. 

33. The host cell of Claim 32 wherein said polyketide-producing microorganism is 
selected from the group consisting of Saccharopolyspora species and Streptomyces species. 

34. The host cell of Claim 33 wherein said polyketide-producing microorganism is 
Saccharopolyspora erythraea. 

35. A method for altering the substrate specificity of a polyketide synthase in a first 
polyketide-producing microorganism comprising the steps of: 

(a) isolating a fu-st and second genomic DNA segment, each comprising a 
polyketide synthase wherein said first genomic DNA segment is fi:-om said first polyketide- 
producing microorganism and said second genomic DNA segment is from said first 
polyketide-producing microorganism or a second polyketide-producing microorganism; 

(b) identifying one or more discrete fragments of said first genomic DNA 
segment, each of which encodes an acyltransferase domain; 

(c) identifying one or more discrete fi-agments of said second genomic DNA 
segment, each of which encodes a related domain to said acyltransferase domain of said first 
genomic DNA segment; and 

(d) transforming a cell of said first polyketide-producing microorganism with one 
or more of said fragnients fi-om step (c) under conditions suitable for the occurrence of a 
homologous recombination event, leading to the replacement of one or more of said 



BNSDOCID: <WO ^9851695A2^L! 



wo 98/51695 



PCT/US98/09518 



99 

fragments from said first genomic DNA segment with one or more of said fragments from 
step (c). 

36. The method of Claim 35 wherein said first polyketide-producing microorganism is 
Saccharopolyspora erythraea, 

37. The method of Claim 35 wherein said second polyketide-producing microorganism is 
Streptomyces. 

38. The method of Claim 35 wherein said second polyketide-producing microorganism is 
Saccharopolyspora erythraea. 

39. The method of Claim 35 wherein said related domain is selected from the group 
consisting of SEQ ID NO:3 1 , SEQ ID NO:32, SEQ ID NO:33 and SEQ ID NO:34. 
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GGGCCGCTGGCGGTGATGTTCACCGGACAGGGCTCCCAACGCCCCGGCATGGGACGACAG 60 
GPLAVMFTGQGSQRPGMGRQ 20 

TTGTACGAGCACTTCCCCGTCTSCGCCCAGGCACTGGACGAGGTCTTCGCACTCGCCACC 120 
LYEHFPVFAQALDEVFALAT 40 

CCCGGACTACGCGAGGTGATGTTCGACCCCGACCAGGCCGAAACACTCC AACGCACCGAC 1 80 
PGLREVMFDPDQAETLQRTD 60 

CAC6CCCAGATCGCCCTGTTCGCCTTCGAAACCGCCCTCTACCGACTCTGGGAATCCTGG 240 
HAQIALFAFETALYRLWESW 80 

GGCCTGCGACCCGACATGGTCTGCGGACACTCGGTCGGA6AAATCACCGCAGCCCACGTC 300 
GLRPDMVCGHSVGEITAAHV 100 

TCC6GCACCCTCACCCTCCCCGACGCCGTCCACCTCGTCACCACACGCGGCACCCTCATG 360 
SGTLTLPDAVHLVTTRGTLM 120 

CAAAACCTGCCCCCCGGCGGCGCCATGCTCGCCGTCGCCACCGACCCCCACACCCTCCAi-. 420 
QNLPPGGAMLAVATDPHTLQ 140 

CCCCACCTCGACAACCACCACGACACCATCTCCATCGCCGCCATCAACGGCCCCCAC6CC 480 
PHLDNHHDTISIAAINGPHA 160 

ACCGTCCTCTCCGGCGACCGCACCACCCTCCACCACATCGCCACCCAACTCAACACCAAA 540 
TVLSGDRTTLHHIATQLNTK 180 

ACCAACTG6CTCAACGTCAGCCACGCCTTCCACTCCCCCCTCATGCAACCCATCCTCCAA 600 
T NWLNVSHAFHSPLMQPILQ 200 

CCCTTCACCACCACCCTCAACACCCTCACCCACCACCCCCCACACACACCCCTCATCAGC 660 
PFTTTLNTLTHHPPHTPLIS 220 

ATGCTCACCGCCACACCCACCCACCCC6ACACCACCCACTGGACCCAGCACATCACC6CA 720 
MLTATPTHPDTTHWTQHITA 240 

CCCGTCCGCTACACCGACACCCTCCACCACCTCCACCACCACGGCATCACCACCTACCTC 780 
PVRYTDTLHHLHHHGITTYL 260 

6AAATCG6CCCCGACACCACCCTCACCGCCCTCGCCCGCACCACCCTCCCCACCACCACC 840 
EIGPDTTLTALARTTLPTTT 280 

CACaCATCCCCACCACCCGCCGCAACCACAACGAAGTCCGCAGCACGAACGAGGCGTTG 900 
H L IPTTRRNHNEVRSTNEAL 300 

GGCAGGGTGTTCAGCGTGG6CCACTCGGTGGACTGGCGGGCCCTCACTCCGACCGG6AGG 960 
GRVFSV6HSVDWRALTPTGR 320 

CGTACCTCCCTGCCGACGTACCCCT ^985 
RTSLPTYP 328 
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CCTAGGACGGCA6TCCTGCTCACCGGGCAGGGTTCCCAGC6TCAGGGCATGGGGCGCGAA 60 
PRTAVLLT6QGSQRQGMGRE 20 

CTGTAC6ACCGGTCACCGGTGTTC6CCGCCTCGTTCGACGCGATCTGCGCTCAACTCGAC 120 
LYDRSPVFAASFDAICAQLD 40 

GGGCAACTGCCTCGTCCCCTCAAGGACGTTCTCTTCGCCCCCGAGGGGTCGGAGGACGCC 180 
GQLPRPLKDVLFAPE6SEDA 60 

GCGCTCASC6ACCGTACGGTGTTCACACA6GCGGCTCTGTTCGCCGTGGAGACCTCCCTG 240 
ALIDRTVFTQAALFAVETSL 80 

TTCCGGCTGTTCGAGGCCCACG6CCTCGSCCCCGACTACCTCASCGGCCACTCCATCGGC 300 
FRLFEAHGLVPDYLIGHSIG 100 

GAAGTGACCGCGGCCCGCCTGGCCGGGGTCCTCGATCTGGCGGACGCGTGCGTCCTGGTC 360 
EVTAAHLAGVLDLADACVLV 120 

GCCCACCGCGGCCGCCTGATGCAGTCGGCCCGG6CCGGCGGCGCGATGGCCGC6GTCCAG 420 
AHRGRLMQSARA6GAMAAVQ 140 

GCGAGCGAGGACGAGGTACGCGAGGCCCTCGCGACCTTCGACGATGCGGTTCCCGTGGCC 480 
ASEDEVREALATFDDAVAVA 160 

GGAGTCAACGGCCCGAACGCCACCGTCGTCTCCGGCGACGAGGACGCGGTCGAGCGGCTG 540 
GVNGPNATVVS6DEDAVERL 180 

GTCGCGCGCTGGCGCGAGCAGGGCAGGCGGAC6AAGCGGCTGCCGGTCAGCCACGCCTTC 600 
VARWREQGRRTKRLPVSHAF 200 

CACTCGCCGCACATGGACGGGATCGTCGACGAGTTCGTCACCGCCGTCTCCGGGCTCACC 660 
HSPHMIGIVDEFVTAVSGLT 220 

TTCCGCTCCCCGACGLTCCCGGTCGTCTCCAACGTCACCGGGACCCTCGCCACCGTCGAC 720 
FRSPTIPVVSNVTGTLATVD 240 

CACCTGACCTCGCCCGCGTACTGGGCACGCCACATCCGCGAGGCCGTGCGCTTCGCCGAC 780 
QLTSPAYWARHIREAVRFAD 260 

GGGGTGCGGTACCTGGAGGGCGAGGGCGTCACCGAATGGCTGGAGCTCGGGCCCGACGGC 840 
GVRYLEGEGVTEWLELGPDG 230 

GTTCTCGTC6CCCT66TCGAG6ACTGCCTGGCGAAGGAGGCGGGATCGCTCGCGTCCGCC 900 
VLVALVEDCLAKEA6SLASA 300 

CTGCGCAAGGGGGCGAGCGAGCCCCACACCGTGGGCGCGGCCATGGCCCGCGCGGTGCTG 960 
LRKGASEPH TVGAAMARAVL 320 

CGCGGATCCGGCCCCGACTGGGCGGCGGTGTTCCCCG6CGCACGGCGGGTCGACCTTCCG 1020 
R6SGPDWAAVFPGARRVDLP 340 

ACGTATGCAT 1030 
T Y A 343 
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ISOLATE Avr\l/Nsi\ rapATU FRAGMENT AND CLONE IT INTO 
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EcoRl-Avrll' 



■1023 bp- 




Nsil -BamHl 



EcoRl 



BOIATE Avrll/Nsil rapATU FRAGMENT AND CLONE IT INTO 
AvrU/m SITES OF THE pCS5/AT2-FlANK 

16686 . 17853 18864 , 19955 

-JAvrll-BamHl-Nsil\ — \HindHl 



16686 



fcoRI 



-1 kb 



Avrll 
17853 
1 




Nsil 
18864 



~1 kb 



5' FLANKING REGION rapATU DOMAIN 3' FLANKING REGION, 



19955 
\Hindm 
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GCCGACCGTGTCGTGTTCGTGTTCCCCGGCCAGGGCTCGCAGTGGGCCGGAATGGCCGAG 60 
ADRVVFVFPGQGSQWAGMAE 20 

GGGCTGCTGGAGCGGTCCGGCGCGTTCCGGAGTGCGGCCGACTCGT6C6AC6CC6CGCTG 120 
GLLERSGAFRSAADSCDAAL40 

CGGCCGTACCTCGGCTGCTCGGT6CTGAGCGTGCT6CGCGG6GAACCGGAC6CGCCCTCG 180 
RPYLGWSVLSVLRGEPDAPS60 

CTCGACCGGGTCGACGTC6TGCAGCCGGTGCTGTTCACGATGATGGTCTCGCTCGCGGCG 240 
LDRVDVVQPVLFTMMVSLAA80 

GTCTGGCGTGCGCTGGGCGTGGAACCGGCGGCGGTCGTCGGGCACTCGCAGGGTGAGATC 300 
VWRALGVEPAAVVGHSQGEIlOO 

GCCGCTGCCCATGTCGCCGGTGCGCTGTCGCTGGACGACTC6GCCCGGATCGTCGCCCTG 360 
AAAHVAGALSLDDSARI VAL120 

CGCAGTCGGGCGTGGCTCGGACTGGCGG6CAAGGGCGGCATGGTGGCGGTGCCGATGCC6 420 
RSRAWLGLA6KGGMVAVPMP140 

GCGGAGGAGCTGCGGCCGCGGCTGGTGACGTGGGGGGACCGTCTGGCCGTCGCCGCC6TC 480 
AEELRPRLVTWG DRLAVAAV160 

AACAGCCCCGGTTCCTGCGCCGTCGCAGGCGACCCGGAGGCGCTG6CCGAACTGGTGGCG 540 
NSP6SCAVAGDPEALAELVA180 

CTGCTGACCGGTGAGGGGGTGCACGCCCGGCCGATCCCCGGCGTCGACACGGCGGGCCAC 600 
LLTGEGVHARPIPGVDTAGH200 

TCGCCGCAGGTGGACGCGTTGCGGGCTCATCTGCTGGAGGTGCTGGCCCCGGTCGCCCCC 660 
SPQVDALRAHLLEVLAPVAP220 

CGACCGGCCGACATCCCGTTCTACTCGACG6TGACCGGCGGGCTGCTGGACGGCACC6A6 720 
RPADTPFYSTVTGGLLDGTE240 

CTGGACGCGACGTACTGGTACCGCAACATGCGC6AGCCCGTCGAGTTCGAGCGGGCCACA 780 
LDATYWY RNMREPVEF ERAT260 

CGGGC6CTGATCGCCGACG6GCACGACGTCTTCCTGGAGACGAGCCCGCATCCCATGCTG 840 
RAL IADGHDVFLETSPHPML280 

GCCGTGGCGCTGGAGCAGACGGTCACCGACGCCGGCACCGACGCGGCGGTGCTCGGGACC 900 
AVALEQTVTDAGTDAAVLGT300 

CTGCGCCGCCGCCAC66CGGTCCTCGCGCGCT6GCCCTGGCCGTCTGCC6CGCCTTCGCG 960 
LRRRHGG PRALALAVCRAFA320 

CACGGCGTGGAGGTGGACCCCGAGGCGGTCTTCGGTCCGGGCGCACGGCCCGTGGAGTTG 1020 
HGVEVDPEAVFGPGARPVEL340 

CCCACCTATCCG 1032 
P T Y P 344 
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PROTBN SEQUENCE S A P R K P 

ORIGINAL SEQUENCE TCCGCGCCGCGCAAGCCG 
ALTERED SEQUENCE TCCGCGCCTAGG AAGCCG 

I I 

Avrll SITE 



PCR OUGOS FOR 5' -FlANK>lwII SITE 



5' - FLANK SEQUENCE 



N-TERMINAL OUGO 5' -GAGAGAGGAACCAACGCGCACGTGATCGTCGAAGAGGCACCAGC 
(SEQ. ID. NO. 21) 

■ ^ 5' -FLANK SEQUENCE 

C-TERMINAL OUGO 5'-GAGAG AGGATCC G ACCTAGG CGCGGAGGTCACCGGCGCGACGGCG 

(SEQ. ID. NO. 22) BamHl SITE Avrll SITE 



PCR OUGOS FOR NidATS FRAGMENT 

, BEGINNING OF NidATS 

><-TERMINAL OUGO 5'-GAGAGACCTAGGAAGCCGGTGTTCGTGTTCCCCGGCCAGGGCT 

(SEQ. ID. NO. 23) ^j;^^ 

, 3' END OF NidATS 

C-TERMINAL OUGO S'-GAGAG AGGATCC Gy ^GCCGGCCG TGCGCCCGGACCGAAGACCGCCTC 

(SEQ. ID. NO. 24) Bamtil SITE Fsel SITE 



FIG.23 
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Xhol 




Fsel 
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CGCGCGCCTGCCTTCGTCTTTCCCGGGCAGGGCGCCCAGTGGGCCGGACTGGGAGCGCGG 60 
RAPAFVFPGQ6AQWA6LGAR20 

CTCCTCGCGGACTCCCCCGTCTTCCGCGCCAGGGCCGAGGCATGCGCGCGGGCGCTG6AG 120 
LLADSPVFRARAEACARALE40 

CCTCACCTCGACTGGTCG6TCCTC6ACGTGCTGGCCG6CGCCCCGGGCACCCCTCCCATC 180 
PHLDWSVLDVLAGAPGTPPI60 

GACCGGGCCGACGTGGTGCAGCCGGTGCTGTTCACCACGATGGTCTCGCTGGCCGCCCTC 240 
DRADVVQPVLFTTMVSLAAL80 

TGGGAGGCCCACGGGGTGCG6CCG6CCGCGGTC6T6GGCCACTCCCA6GGCGAGGTGGCC 300 
W EAHGVRPAAVVGHSQGEVAIOO 

GCGGCCTGCGTGGCCGGT6CCCTGTCGCTGGACGACGCTGCCCTGGTGATCGCCGGACGC 360 
AACVAGALSLDDAALVIAGR120 

AGCAGGCTGTGGGGGCGGCTGGCCGGGAACGGCGGGATGCTCGCGGTGATGGCTCCGGCC 420 
SRLWGRLAGNG6MLAVMAPA140 

GAGCGGATCCGT6AGCT6CTCGAACCATGGCGGCAGCGGATTTCGGTGGCGGCGGTCAAT 480 
ERIRELLEPWRQRISVAAVN160 

GGCCCCGCCTCGGTCACCGTCTCCGGTGAC6CGCTCGCGCTGGAGGAGTTCGGCGCGCGG 540 
GPASVTVSGDALALEEFGARIBO 

CTCTCCGCCGAGGGGGTGCTGCGCTGGCCGCTGCCGGGCGTCGACTTCGCC6GCCACTCG 600 
LSAEGVLRWPLPGVDFAGHS. 200 

CCGCAGGTGGAGGAGTTCC GC5CTGAGCTCCTGGACCTGCTCTCCGGCGTACGGCCGGC 660 
PQVEEFRAELLDLLSGVRPA220 

CCTTCGCGGATACCTTTCTTCTCCACCGTGACGGCGGGTCCTTGCGGCGGCGACCAGCTG 720 
PSR1PFPSTVTAGPCGGDQL240 

GACGGGGCGTACTGGTACCGCAACACGCGCGAACCCGTGGAGTTCGACGCCACGGTCCGG 780 
DGAYWYRNTREPVEFDATVR260 

GCGCTGCTGCGTGC6GGCCATCACACGTTCATCGAGGTCGGTCCGCATCCGCTGCTCAAC 840 
ALLRAGHHTFIEVGPHPLLN280 

GCCGCGATCGACGAGATC6CAGCGGACGAGGGGGTAGCGGCCACG6CCCT6CATACGCTC 900 
AAIDEIAADE6VAATALHTL300 

CAGC6GG6CGCTGGC6GCCTTGACCGCGTGCGCAAC6CG6TGG6CGCCGCTTTCGCGCAC 960 
QR6A6GLDRVRNAVGAAFAH320 

GGTGTCCGGGTCGACTGGAACGCCCTGTTCGAGGGCACCGGTGCGCGCAGGGTGCCGCTT 1020 
GVRVDWNALFEGTGARRVPL340 

CCCTCGTACGCCTTC 1035 
P S Y A F 345 
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PGR OLIGOS: 

Avrll 
I 1 

N-TERMINAL OLIGO: 5' fcoRITaq-CCTAGGGTCGCCTTCGTCTrTCCCGGGCAGG-3 

GCGC CCT 

I AND Vol CQDON I HOMOLOGOUS REGION 
Nsil 



I I 

C-TERMINAL OUGO: 5' 5^/11 Tag-ATGCATACGAGGGAAGCGGCACCCTGC-3 

G G 

I ENGINEEREDA/s/I I HOMOLOGOUS REGION 



PGR CLONING: 
NIDDAMYCIN CLUSTER 



NidAT6 DOMAIN 



PGR NidAT6 DOMAIN WITH ENGINEERED OUGOS 



-EcoRl-Avrll- 



1024 bp- 



Nsil -BglU-y 



NidAT6 DOMAIN 

CLONED INTO pUC18 Eco Rl /Bam HI SHES 
AND SEQUENCES RDELITY CONFIRMED 



£coRI ->1i^rII 




Nsil- Bgl 11/ BamHl 

(CLONED NidATB DOMAIN WITH 
INTRODUCED Avrll /Nsil SITES) 



FIG.26 



SUBSTITUTE SHEET (RULE 26) 



BNSCnCID: <WO ^e8Sie96A2_L> 



wo 98/51695 



PCT/US98/09518 



31/36 



1024 bp 




ISOIATE /1w-II/Vs/INidAT6 FRAGMENT AND CLONE fT INTO 
AvrW/Nsil SITES OF THE pCS5/AT2-Fl>NK 
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Ery AI 



ery ATs DOiylAIN 
5' — 3' 



5' FLANKING REGION WITH 
ENGINEERED AvrW SITE AT 3' END 



pQp \ 3' FLANKING REGION WITH 



ENGINEERED A&/I SITE AT 5' END 



-1.2 kb 



902 



5' FLANKING REGION 



/1wII-5fl/77HI3' 



CLONED IN pUC18 fcoRI/flo/wHI 
AND SEQUENCES RDEUTY CONRRMED 



1908 . - 

S'fe/wHI-Afe/ll ^ \Hm<mi- 3' 
3' FIANKING REGION 



CLONED IN pUmBamm/HindlU 
SITES AND SEQUENCE CONRRMED 



CLONE 5' FLANK REGION INTO pCS5 EcoRl /BamHl SITES. GENERATING 
pCS5/ATs5' -FLANK. THEN CLONE 3' FLANK REGION INTO flo/nHI ////>?</ II I 
SITES OF THE pCS5/ATS/5' FLANK. RESULTING IN pCS5/ATs-FLANK. 
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EcoRl-Avrll 




Nsil-BamHl 



ISOLATE /l(rII/V5/INidAT6 FRAGMENT AND CLONE 
IT INTO >lw-II/A&/I SITES OF THE pCS5/ATs-FLANK 
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ISOIATE /Iw-II/Afe/'INidATB FRAGMEMT AND CLONE 
IT INTO SITES OF THE pCSS/ATl-RANK 
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COSMID |2 
RapP 



RapA 



ropp I I rop UGASE rap ERS ID rop ACPs | rap KSI 



Eco RI 



Sphl 
0.11 kbl 



2.1 kb 



Xho\ 

I 0.77 kb ! 



Avrll 



I PCR CLONING 
pSL1 180/0.11 



Hindm 
Afe/I 



PREPARED 2.1 kb o.«.on/n-F-, 
Sphl/Xhol FRAGMENT pSL1180/0.77 



SUBCLONING 2.1 kb FRAGMENT 
INTO pSLl 1800/0.11 



PREPARED 0.77 kb,^^^ 
Xhol/ Hindm FRAGMENT 



pSL1180/0.1 1/2.1 



SUBCLONING 0.77 kb FRAGMENT 
INTO pSL1 180/0.1 1/2.1 

Avrll Sphl ^ Xhol Ate/ 1 
Eco RI 1 0.11 kbl 2.1 kb 0.77 kb ^ Hindm 
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EcoRl-Avrll- 



3.0 kb — 




■Ns/l -BamHl 




pSLl180/rapligase 3.0 




902 



1908 



1 9 \th """^ ~1 2 kh 

fcoRI L l/l>rII-a7mHI-Afe/lJ — \Hinm 



5' FLANKING REGION 



3' FLANKING REGION 



ISOLATE Awil/Nsil rapligase 3.0 FRAGMENT 
AND CLONE IT INTO Awll/Nsil SITES OF THE^ 
pCS5/ATs-FLANK 




fcoRI 



-1.2 kb 



Avrll 
902 



Nsil 
1908 



-1.2 kb 



J////7£/III 



5' FLANKING REGION rapligase 3.0 





pEryATs/rapligase 3.0 
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